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EXPLORING EIDETIC IMAGERY AMONG THE KAMBA 
OF CENTRAL KENYA* 


African Studies, Yale University 


Lronarp W. Doos! 


A. INTRODUCTION 


For many years it has been stated that eidetic images, defined in their visual 
form as images “persisting after stimulation, relatively accurate in detail, 
colored positively, and capable of being scanned” (6), are very rare among 
adults in Western society (e.g., 8, p. 651). Recently, however, the incidence 
of this type of imagery has been found to be significantly higher in a sample 
of Ibo in Eastern Nigeria (West Africa), especially among those living in a 
rural area (5). In the exploratory research to be reported, the author has 
sought tentative answers to five questions: (а) Does eidetic imagery occur 
equally frequently in another African society? i.e, what is its incidence? 
(2) If it does occur, what are its distinctive attributes? (c) Does it facili- 
tate accuracy in reporting immediately after perception and thereafter? (4) 
With what psychological or social factors is it associated? i.e., what are its 
relationshibs? and (e) What functions does it perform from the viewpoint 
of people who possess it? i.e., what is its ethnopsychology? 

Whenever possible, promising new leads were pursued, the most pressing 
of which suggested the possibility of another kind of imagery. 

Since neither theory nor knowledge concerning eidetic images can indicate, 
alas, the kind of society in which they are likely to flourish, the investigator 
selected the Kamba for the fortuitous reason that he happened to be in East 
Africa for another purpose in the summer of 1964, and this tribe was close at 
hand. The Kamba (sometimes the plural form, Wakamba, is used to designate 
them) are a Bantu group numbering approximately 600,000 and living in 
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central Kenya. In culture and in language they are closely related to the better- 
known Kikuyu. The reservation on which the research was conducted is a 
flat, rather arid plain beginning about 20 miles northeast of Nairobi and 
having an altitude close to 6,000 feet above sea level. The Ss tested in the 
study live at least another 20 miles further east. The round adobe houses of 
the Kamba are fairly close together. The principal road is unpaved, reasonably 
good, usually very bumpy and, during the dry season, most unpleasantly dusty. 
Markets are held twice a week in small centers which have shops and other 
public services. There is no electricity in the area. 

Three relevant characteristics of the Kamba language must be noted. Like 
many Bantu languages, it is deficient with respect to color terms; consequently, 
informants experienced some difficulty in naming colors and often used 
English terms that have entered their language. The distinction between 
present and imperfect tenses is as clear-cut as in English, although the two 
tenses may be used carelessly because they are often distinguished only by a 
single phoneme. Finally, the phrase, “picture in my head,” which plays an 
important role in this study, can be translated literally from English; in fact 
the word for “picture” is generally one derived from the English term. 


B. THE SAMPLE 


Since a random sample of informants was neither necessary nor possible, 
the investigator and his interpreter simply approached a dwelling usually 
located not too far from the main road and asked the first adult they met 
whether he would be willing to look at some pictures and be questioned by 
"a professor from America." Men were generally sought because women either 
claimed to be occupied with household chores or were frequently distracted 
by young children; in some instances adolescents became the Ss. The very 
few people who refused seemed not suspicious nor unfriendly but in fact busy; 
some declined to be reinterviewed probably because their secret hope of being 
paid the first time had not been realized. 

A total of 49 interviews was completed: 20 females, 29 males (six per 
cent more females than in the Ibo study) ; ages, not very reliably reported, 
ranged from 14 to 70, with a mean of 27.8 years (25.7 for the Ibo sample). 
Not all the findings, however, are based upon the total sample as the contents 
of the schedule were deliberately varied in the course of the investigation to 
adapt the procedure to the minor crises that arose and, more importantly, to 
take advantage of new information. For this reason, whenever the number 
of cases is not 49, the number giving rise to a statement is supplied in 
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parentheses, sometimes as part of a ratio indicating the proportion of Ss 
conforming to the indicated tendency. 


С. FIELD CONDITIONS 


With two exceptions, all interviewing and testing took place out-of-doors 
and, whenever possible, in the shade. The materials were placed upon a folding 
table, and $ was provided with a folding chair. $ sat directly across the 
table from the investigator, so that his behavior (especially his eye move- 
ments) could be closely observed. The interpreter was seated at one of the 
other sides of the table. Almost always, at least a half-dozen spectators of 
all ages stood around to watch and listen; infants, crowing chickens, placid 
dogs, and sometimes cattle also intruded. The general atmosphere, in short, 
was jolly, cordial, and incredibly noisy; most Ss, however, seemed able to 
concentrate upon the tasks at hand, probably because they were accustomed to 
living under these conditions. 

Virtually all interviews were conducted in Kamba through the interpreter, 
but in a few instances $ was addressed in English and replied in English. 
Verbatim notes were taken in English. The last eight interviews and all the 
follow-up interviews were also recorded on tape. 

The field work progressed slowly. Living on the reservation was not 
practical; hence it was necessary to drive daily at least 90 miles to reach the 
site and return to Nairobi. A good deal of time was expended in procuring 
Ss and in responding to their endearing rites of courtesy and hospitality. The 
interviewing and testing of each S took at least an hour. Not more than four 
schedules were ever completed on a single day after nine hours of driving, 
searching, and testing. In the course of a month less than one-third of the 


49 Ss could be located and tested a second time. 
D. PROCEDURE 


All Ss were tested for eidetic imagery in a standardized manner adopted 
from Haber and Haber (6) and also utilized in the Ibo study. After being 
given the first half of a visual retention test, $ faced a blank piece of card- 
board on a small stand. The classical procedure was then followed. To ac- 
custom $ to seeing images on that screen, $ was shown а red square for 10 
seconds; then he was questioned about his after-image, which almost always 
appeared and turned out to be negative rather than positive. The training 


was repeated with a black square. Next, a drawing or photograph—hereafter 


referred to as pictures—was exposed for 30 seconds after the $ had been 
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instructed to scan the entire stimulus. $ reported immediately afterwards 
what he saw on the screen. If he could see nothing, he was asked to recall 
what he could from memory until he had supplied enough details either 
voluntarily or upon request to indicate with certainty that he had perceived 
the picture in the first place. 

The first 24 Ss viewed a series of six pictures. At the outset these were 
four drawings and silhouettes from the Haber and Haber series and two 
photographs of definitely African scenes. After the seventh S this series 
was changed to include two pictures from Haber and Haber, two African 
scenes, and two complicated montages of various objects: animals, numbers, 
and letters. 

At this point the procedure was drastically changed because most of the 
Ss had been reporting that ordinarily they did not see “pictures” in front of 
their eyes or themselves—rather they referred just to "pictures" without 
locating them or, more usually, to “pictures in my head." The remaining 25 
Ss were first tested in the same way for eidetic images, except that the number 
of pictures in the series was reduced to four. The second half of Benton's 
test (3) followed, after which every $ but one was shown a second series 
of four pictures and questioned as to whether or not he was seeing a "picture 
in your head." The two series were presented in alternate order from $ to 8; 
each series contained two pictures from Haber and Haber, one African 
scene, and one montage. 

Since the traditional procedure eliciting eidetic images almost always makes 
use of a screen, those images elicted in the absence of the screen will here- 
after be called pictorial images (PI) to distinguish them from eidetic images 
(EI). For reasons to be indicated subsequently, the pictorial images (PI) 
are considered to be different from though related to what the literature 
terms memory images (MI). An S seeing nothing on the screen or reporting 
no “picture” in his head will be said to have had no image (OI). 

As in the Ibo study, each response to a picture was classified as E (‘“. . , 
correct minute details .... [S] continued to scan the screen . . . as he reported 
that the image was fading or had faded, he changed tenses from the present 
to the past”); E- (*. . . the details were less numerous or the duration too 
short to permit a full report by [S] in the present tense") i or © (7... m 
trace . . . or a flat statement to the effect that ‘I see nothing! "), A similar 
classification was adopted when Ss were asked to report whether or not 
they had picture imagery (PI), except that the criterion of duration could 
not be employed: P, P-, and O. Reliability was high—over 90 per cent—as 
determined in two ways. First, the investigator and the interpreter generally 
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whispered their judgments to each other as $ was being exposed to the next 
picture. Then a sample of the recordings on tape was classified by two in- 
dependent judges. 

As in the Ibo study, each $ was assigned to one of four EI groups and 
for the Ss viewing the two series of pictures, also to one of four PI йир 
Criteria for assignment were slightly different from the Ibo study since not 
seven but either four or six pictures were used: (а) All E (or P)—no Os, 
no minus; (b) Mostly E (or P)—no Os and no more than one minus (if 
four) or two minus (if six) ; (c) Some E (or P)—no more than one or two 
Os (if four), one to four Os (if six) ; and (4) No E (or P)—three or four 
Os (if four), five or six Os (if six). 


E. RESULTS 
1. Incidence 


The imagery groups in which Ss were classified as well as comparable data 
from the Ibo research are given in Table 1. Virtually the same percentage 
of Ss in both studies, 20 per cent, were found to be All E. The total having 
All E, Mostly E, or Some E among the Ibo was 53 per cent, whereas the 


TABLE 1 
CLASSIFICATION OF SS 
Ib Kamba 
Category E E 
All E or P 9 10 12 
Mostly E or P 6 2 7 
Some E or P 9 7 3 
No E or P 21 30 2 
Total 45 49 24 


er cent. The proportion of Kamba with PI 
4 such images at least once (22 out of 
24). As in the Ibo study, there is no statistically significant difference between 
the various pictures with respect to the frequency of EI elicited. One picture 
of an African scene, preferred by more Ss than any other, produced 43 per 
cent EI defined as the sum of E and E— (16 out of 37) ; the corresponding 
figure for a non-African scene was 41 per cent (15 out of 36). 


figure among the Kamba was 39 p 
was very high: almost all Ss reporte 


2. Attributes 


Ss reporting either EI or PI behaved differently from the way they did in 


the absence of such images: 


1. EI and PI were described in the present tense; OI, in the past tense. 


8 JOURNAL OF SOCIAL PSYCHOLOGY 


Occasionally an $ with PI but never one with EI slipped into the past tense. 
When questioned concerning the shift, he immediately moved back into the 
present. 

2. The eyes of an 8 with OI might wander in all directions while being 
questioned, but usually he looked directly at the interpreter or the investigator. 
In contrast, an $ with EI concentrated upon the blank screen until the time 
he reported that the image had disappeared; and often spontaneously and 
always after prompting he would point to an approximately correct portion of 
that screen in order to locate one detail in relation to another. The modal 
behavior of Ss with PI seemed to be to avoid fixating upon a person and in- 
stead to stare blankly straight ahead and in some instances to close their eyes. 

3. Ss with EI or PI were always willing and able to answer the question, 
“Is what you see on the screen [in your head] very clear?" The same question 
made no sense to Ss reporting О]. Not typically but from time to time, a 
few Ss stated that their images were “just as clear" as the original pictures, 
When an $ could not supply the requested information, he might explain the 
deficiency in terms of the image he claimed to be experiencing. Concerning 
his PI, a young man said, “It is not appearing very well.” A young woman 
commented on two PI she had been asked to observe simultaneously: “I see 
them but I cannot locate everything that is there.” The sister of the § just 
mentioned stated that images derived from the same two pictures were 
“colliding.” f 

Aside from their respective external and internal locations, EI differed 
from PI in two respects: 


1. A majority of the Ss themselves distinguisked between the kinds of 
imagery. They reported no EI during the first series of pictures, but then 
claimed to have only PI or at least some PI during the second series (15 out 
of 24). Seven of the remaining Ss had both EI and PI, and a pair had no 
imagery; but no $ had EI without also having PI. 

2. With no exceptions, the duration of PI was significantly greater than 
EI. Save for a single instance, all EI eventually disappeared. Eidetic images 
remained from 40 to 198 seconds, with a mean of 111 seconds (N17), 
The exception was a 15-year-old girl whose images, she stated, lasted as long 
as she wished to retain them. She was questioned for six minutes concerning 
one of the pictures that she claimed to be "seeing" as vividly at the end as 
at the outset. In contrast, with three exceptions, PI seemed never to disappear. 
Ss asserted that, after two to five minutes (when the investigator arbitrarily 
turned to the next picture or task), the images were just as clear or almost 
as clear as they had been immediately after exposure. In the case of one $, 
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questions about the PI were posed for eight minutes—in the case of another 
for 12 minutes—and still, the S maintained that the images persisted. Only 
one of the three exceptions stated that all PI disappeared (mean = 118 
seconds). The other two Ss reported that, except for one picture each, their 
other PI were either persisting or becoming unclear. 


3. Accuracy 


Although individual Ss with EI or PI displayed impressive feats of memory, 
which anecdotally might be mistaken for minor miracles, careful and varied 
analyses reveal that type of imagery had virtually no effect upon the accuracy 
of report in the absence of the pictures. Recall was measured at three intervals: 

a. Immediate recall. After seeing each picture, as indicated earlier, S was 
questioned about his EI, PI, or OI. 

«b. Short-time recall. After being given (during the initial interview) 
a series of tests and questions that lasted from 14 to 66 minutes, $$ were 
again asked to look at the blank screen and report whether or not they saw 
any of the pictures thereon (N — 46). Next they named the picture they 
liked best. Then they were told to recall whatever pictures they could, either 
On the screen or as pictures in their heads; and finally, after spontaneous 
recall had been exhausted, eight Ss were asked whether or not “you can see 
on the screen or as a picture in your head the one in which . . ."—and there 
followed a brief description of the picture, roughly in the words used by the 
$ immediately after exposure. 

c. Long-time recall. From six days to over three weeks after the first 
interview, with the mean elapsed time being slightly under two weeks, 17 
Ss were located and interviewed a second time. First, they were given an 
irrelevant icebreaker concerning daydreaming. Then they were shown the 
same blank screen and asked to report whether or not "you can see on it any 
of the pictures we showed you when we were here last time." In the absence 
of EI (or after the appearance of the last EI), they were questioned concerning 
Possible PI; next they tried to reply to the simple question, “which ones can 
you remember?" 'These invitations to spontaneous recall were followed by 
Prompting inquiries about the remaining pictures. 


Two modes of comparison and various measures of recall were employed 


in connection with the three types of recall. For immediate and short-time 


recall, the reference point was S: nineteen Ss with EI (ie., the sum of 
All E, Mostly E, and Some E) vs. 30 with no EI and seven 8s with both EI 
and PI vs. 15 Ss with PI but with no EI. The Ns were smaller for short-time 
recall because the section of the interview containing this test was omitted in 
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some instances. Then recall data were also broken down in terms of the 
three categories of images, and a picture-by-picture analysis was made. 

Measures of recall were those that took into account the fact that the 
amount of recall was influenced by the number of questions that were asked. 
Up to a point, additional information could almost always be obtained by 
further inquiries; also the number of questions varied, inasmuch as Ss provided 
information with varying degrees of agility. Two of the measures were 
ratios. One ratio shows the relation between the number of content items 
(or colors) correctly reported and the number of appropriate questions posed 
by the investigator. In addition, two specific details were singled out for each 
picture (e.g., the number of branches on a tree, the number of men in a 
group having hats on their heads), and note was made of the correctness of 
the information supplied either spontaneously or after prompting. Appropriate- 
ness and feasibility (especially the number of cases available) determined 
which of the measures was used to measure a given type of recall. For short- 
time and long-time recall the number of pictures recalled spontaneously or 
after prompting was used as another measure. 

Although the findings largely were negative, certain observations and 
modal tendencies deserve brief mention. For immediate recall the mean number 
of correct items recalled per inquiry was calculated for the three kinds of 
imagery evoked by each picture. The absolute differences between those means 
were extremely small; hence statistically are nonsignificant, yet (without 
exception) the mean for OI was always greater than the one for either EI 
or PI (2 < .01, sign test). In addition, fewer Ss with EI than those with 
OI spontaneously referred to specific details concerning the pictures (№ < .01, 
sign test) ; but this trend does not appear when comparisons are made between 


those with PI and OI or between Ss with either EI or PI and Ss with OI 
under conditions of prompting or of the 


¢ combination of spontaneity and 
prompting. Most Ss correctly recalled color 


s, although those not acquainted 
lty expressing themselves; hence 


The ability of Ss to recall minute information w 
means of four pictures containing numbers or letters. Obviously only those 
claiming sufficient literacy were questioned. The two cars in a crowded 
African scene had license plates with letters and numbers clearly visible. 
Among eleven Ss with EI, only one recalled both series perfectly; two, 
almost so. The corresponding figures for five Ss with PI were two and one; 
and for nine Ss with OI, zero and one. Included among five objects in a 


as most easily tested by 
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montage were 11 random numbers preceded and followed by zeros. Nobody 
recalled these numbers without some error (usually a gross one). The best 
performance among five Ss with EI was recalling correctly the first five 
numbers; among seven Ss with PI, the first three numbers and a pair in the 
middle; and among 10 Ss with ОТ, the first three and the last two numbers. 
Another montage consisted (а) of four random letters at the top and another 
four at the bottom of the various stimuli and (b) of four numbers under the 
figure of a buffalo. Nobody in any of the three imagery groups could correctly 
recall both sets of letters and numbers (N = 21), but a sprinkling in each 
group reported only one of the series correctly. Finally, a dozen random 
numbers (N = 6) or colors (N = 4 illiterates) were exposed to Ss with PIs. 
Nobody could repeat correctly all or even most of the numbers or colors 
either backwards or forwards. 

The accuracy of short-time recall was unrelated not only to the imagery 
in immediate recall but also both to the imagery during that recall and to the 
order in which the pictures originally had been presented. The unimportance 
of the latter factor is shown by the Ss’ favorite picture (a scene in an 
African town: two cars, a large bus, many people) that spontaneously was 
recalled equally well whether it had appeared in the first or the second series 
during exposure. Almost all Ss recalled spontaneously at least one picture 
(43 out of 46). The recall, whether spontaneous or prompted, took various 
forms. Ten Ss reported EI at least once, eleven reported PI at least once, and 
the remainder exclusively OI. Five $s had EI of pictures which had not 
evoked such images immediately after seeing them: three previously had 
claimed PI; and two, OI. | 

During the second interview when long-time recall was tested, every $ 
recalled at least one picture spontaneously with some degree of accuracy 
(N = 17). For one, more, or all of the pictures, seven of the Ss reported that 


recall was based upon scanning EI. Four of them previously had indicated they 


were seeing EI immediately after perception, and two and one had indicated, 


respectively, PI and OI. 
Long-time recall was associated, not unexpectedly, j a 
to imagery: viz., reinforcement during the original interview. Whenever a 


Preference for a picture had been expressed during that peser la picture 
Was spontaneously recalled in the second interview (12 out of 12). Almost 
Without exception, $s who had recalled pictures later in the first interview also 
recalled them in the second one (12 out of 14) ; yet the same proportion of 
the same Ss also recalled one or more pictures that they had not mentioned 


as part of their short-time recall in the first interview. 


with a factor unrelated 
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Identical tendencies appeared with reference to details recalled from the 
pictures. Most but not all the material previously had been reported im- 
mediately after perception or as part of short-time recall. Of the four 
pictures containing numbers or letters, only one evoked more than fragmentary 
recall: the African scene containing two cars with license plates. Four Ss 
were involved: one with PI after perception recalled both plates correctly, 
one with EI recalled only one plate correctly; yet another with EI recalled 
only a fragment of one plate, and one with OI reported fragments from both 
plates. All four Ss recalled, whenever they did recall something, in precisely 
the same way as they had reacted previously immediately after seeing the 
pictures, except that in the case of one $ with EI the second license combina- 
tion had dropped out. In contrast, the $ who had reported OI immediately 
after exposure and who had not been questioned concerning possible PI re- 
called as PI in the later interview a number of items he had ло? originally 
noted. He stated that the mouth of the fish was “closed” without having made 
any previous reference to that mouth. He named the correct number of 
branches on a tree when, previously, he stated he could not remember the 
number. Another § had incorrectly reported a girl’s hair to be “reddish” 
immediately after viewing it. Twelve days later, he noted correctly in his 
PI that it was “yellow.” 


4. Relationships 


Relationships between Ss with and without PIs could not be analyzed 
because too few fell into the latter category; so three series of comparison 


have been made between those having 41 E or Mostly E and those having 
Some E or No E: 


4. Performance on various tests. Six tests revealed no significant dif- 
ferences. They are as follows. 


(1). Sorting test. A simple sorting test in which S is instructed to place 
16 pieces of cardboard (differing with respect to shape, size, color, and 
insignia) into as many piles as he wishes, so that the pieces in each pile have 
“something in common” (4, pp. 316-17). There was no difference in the 
accuracy or the type of performance. Ss with EI did not have 
use color alone or in combination with some other attribute as 
principle (N = 46). 

(2). Benton’s Test of Visual Retention, Multiple-Choice Form (3). 
$ looks at a geometrical figure for 10 seconds and then, after its removal, 


a tendency to 
the organizing 
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tries to find it among four similar drawings, only one of which is the correct 
one (N = 37). 

(3). An olfactory test. At random $ is offered one spice to smell and is 
then told to identify it among three similar spices all of which are presented 
in identical containers. Two trials are given (N = 14). Later in the inter- 
view, 10 Ss were asked again to locate the same two scents. The tests were 
uniformly too easy during the initial presentation and too difficult for later 
recall, 

(4). Repetition test. The ability to repeat immediately afterwards 
(N = 12) or 10 minutes later (N = 7) some simple, short sentences in 
German, a language unknown to the Ss. 

(5). Color-memory test. Two series of eight colored crayons were 
alternately exposed for 15 seconds, they were shuffled, and $ was asked to 
restore them in the same sequence in which they had been shown him 
(N = 14). One series contained colors that are easy and the other those 
difficult to name in the Kamba language. 

(6). Thematic-apperception test. The quantity and quality of responses 
to a verbal TAT (N —11) or to TAT-type instructions concerning two 


small rubber figures (N = 5). 


b. Census-type information. None of the following were related to the 
Possession or nonpossession of EI: age (some Ss, like Africans in many socie- 
ties, were unsure of or could not supply this datum concerning themselves) ; 
sex; religion; age at which Ss, if literate, believed (or guessed ) they had been 
taught to read and write; unreliable, stereotyped information supplied by the 
Ss themselves concerning their contact with the mass media of Kenya; length 
of time spent in the city of Nairobi, the very metropolitan capital of Kenya. 
Of the nine Ss who stated they had never been in Nairobi two had strong 
EI; two, weak EI; and five, none at all. On the other hand, when those 
Ss having any kind of EI are compared with those having no EI, one finds 
an almost significantly larger number of the former to know no English or 
virtually none and to have had either no schooling or less than three years 
(.10 > 5 > .05, 72, two-tail test). 

c. Claims concerning other kinds of imagery. No 
reference to each of the following: 


(1). Modalities appearing in dreams. Mos 
colors (33 out of 37), sounds (24 out of 30), smells 


tastes (24 out of 29) in their dreams. 


differences existed with 


t Ss claimed to experience 
(21 out of 35), and 
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(2). Material gained through other modalities that might be stored as 
PI. Most Ss stated that they had such images concerning sounds (32 out of 
38), smells (27 out of 36), and tastes (21 out of 36). 

(3). The part of the body in which S believed his "self" to be 
(N = 22). Most Ss mentioned the whole body or “every part of me” 
(М = 10) ; brain, mind, head (N = 9) ; and heart (N = 8). 


5. Ethnopsychology 


Among Ss with EI, no one claimed ever to have had the experience of 
seeing images on a blank surface (such as the testing screen) or just in front 
of their eyes. In fact, the kind of surface used is not available to Ss in a 
traditional village because they live in huts having walls relatively dark and 
certainly irregular. Projecting images on to a screen seemed “unnatural.” 
Most of those questioned about the fact that they had reported EI upon the 
blank screen said they were surprised to see the “pictures” there (10 out of 12). 
Unless otherwise indicated, the ethnopsychological material presented in this 
section refers only to PI, which the investigator usually called "pictures" 
while interviewing the first 24 Ss and always called “pictures in your head” 
thereafter and in all the reinterviews, 

All Ss (including those reporting only OI and not tested for PI), when 
asked directly about EI, pictures, or PI, answered questions freely and 
easily about the nature of the “pictures” (N = 43). Most maintained that 
“generally,” “ordinarily,” or “frequently” they experienced PI (25 out of 
29). In fact, among the few Ss who could be questioned, all claimed to have 
scen one or more of the investigator’s drawings or photographs between the 
first and second interviews (N = 9). On eleven occasions, at the very end 
of the interview, the villagers who had been watching the $ were asked en 
masse or one by one whether they themselves also had PI. With only one 
exception, the majority of those present laid claim to having the images. They 
replied as seriously as they did concerning any other factual matter, Possibly 
the number of cases having some form of PI might be increased from 22 
(see Table 1) to about 56, and those not having them from two to about six. 

Every $ except one (N = 38) unhesitatingly stated that PI were useful, 
Most stressed some general function, such as helping one remember events 
or people's faces from the past (N = 16) ; seven believed the pictures made 
them feel happy, and five believed they helped with school work. “I know 
what it looks like when I am here," an $ said of a street in Nairobi which he 
claimed he was viewing in his PI. Some Ss stated that PI possessed movement: 
for example, in recalling a traditional Kamba dance (four out of six). The rest 
claimed that PIs were always stationary. Almost all Ss reported, when 


| 


| 
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queried directly, that they saw such pictures at night just before going to 
sleep (33 out of 45). A handful, while being interviewed concerning PI in 
general, slipped easily and apparently unwittingly into a discussion of their 
dreams. A small number stated that they could insert themselves into any 
PI (seven out of 13): for example, each of two men (when provoked by 
questions) could "see" himself besides the figure of the man fishing from the 
canoe in his PI of one of the pictures. It is perhaps significant, since tradi- 
tionally the Kamba attributed great power to ancestral spirits (10, pp. 177- 
205), that the majority stated, when questioned, that PI enabled them—for 
years and years, some maintained—to recall dead persons as they had been 
at the time of death (19 out of 23) ; but six Ss could also recall dead people 
as they had looked earlier in life. After one $ announced that he knew the 
investigator would be returning to interview him that day because a PI had 
so informed him, other Ss were asked whether PI also helped them anticipate 
the future. A majority (six out of 10) were convinced that PI performed that 
function and mentioned specific instances; but the rest had no faith or only 
Partial faith in them. Ss tended to plead ignorance concerning whether or 
not “other people have pictures in their heads the way you do” (eight out of 
10). 
The majority of Ss stated that sometimes PI appeared spontaneously and 
at other times were voluntarily evoked (eight out of 15). The remainder 
claimed either mode but not both of them. In fact, all Ss indicating EI or 
PI after the original perception and a few claiming only OI, reported 
or after prompting) the reappearance or the first 
as part of short-time or long-time recall. 
' although of course 


(either spontaneously 
appearance of at least one EI or PI 
In this sense, the arousal of the images was “voluntary,” 
it was cued by the investigator's instructions. The images spontaneously 
recalled after the instruction, “Which of the pictures we showed you can 
you now see on the screen [in your head] ?” were reported to be present 
almost instantly. In a few cases, however, Ss paused for 20 seconds or so before 
Providing a verbal description. “Let me think,” one $ said concerning a 


Picture that originally had produced an EI 13 days before; then he became 


silent while apparently concentrating ; finally he emerged with a PI. 


It is possible that Ss stressed the importance of PI since they knew of the 
investigator’s interest in the subject. With an open-ended question concerning 
how $ remembered a friend in his absence (N = 41), only a small number 
mentioned PI voluntarily (N = 9); but a large group (N = 29) also 
referred to those images when asked directly whether or not they utilized PI 


for that purpose. 
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One formal attempt and a number of informal attempts were made to 
check the validity of the belief in the utility of EI and PI. During the inter- 
view itself, as already indicated, the Benton Visual Retention Test was 
split into two equivalent parts, one of which was administered immediately 
before and the other immediately after the series of Pictures testing for EI. 
The order of presenting the two halves was alternated from S to S (N = 37). 
Would the arousal of EI by the pictures carry over to the Benton task? In 
fact, slightly more of those with EI than of those without EI improved after 
the first picture series (41 E and Most E vs. Some E and No E), but that 
difference is not significant (р < -10, one-tail test). With six eidetikers, the 
blank screen was returned during the second part of the Benton test, and Ss 
were instructed to retain the EI thereon while picking out (from among the 
four drawings) "the one you have just seen.” This procedure led to no signifi- 
cant improvement. 

Rough empirical tests on a few $$ also failed to substantiate claims con- 
cerning the utility of the EI. Two Ss who said they were guided by EI in free- 
style drawing were asked to reproduce the pictures giving rise to the EI they 
were experiencing. Both drawings were miserable failures; nevertheless the 
Ss insisted they were being guided by PI. Another § whose PI was remarkably 
detailed and accurate claimed that she could "see" in her PI whole pages from 
books. When tested by being asked to recall (а) pages she had originally read 
in school weeks before and (b) a new page that she had viewed for nine 
minutes in the presence of the investigator, she committed gross errors and 
showed evidence of employing ordinary memory (e.g. she substituted syn- 
onyms for words in the original text of the new Page, and she paraphrased or 
omitted unimportant words). On the other hand, when six Ss were asked to 
reinstate some scene from the past (such as a main street in Nairobi), they 


claimed to be able to do so and, when questioned, immediately began reporting 
details with a high degree of accuracy and in the present tense. 


F. Discussion 


B . m H B э, ? 
the usual criteria—vividness, clarity, duration, positive afterimages, projection, 
Scanning, the use of the present tense in reporting details—have been found 
both relevant and satisfactory. 


LEONARD W. DOOB 17 


The much higher incidence of PI among these Kamba, however, raises 
the challenging question of the relation of such imagery to EI and MI. As 
indicated, PI differed from EI with respect to subjective location and reported 
duration; but they were similar with respect to the tense in which they were 
described and also in the vividness ascribed to them. Without exception, all 
Ss claiming PI maintained they were seeing “the picture in my head” then 
and now; when questioned, they believed that having a PI of a picture was 
different from remembering it (N = 14). Thus some Ss were shown succes- 
sively two montages not in the regular series: each displayed eight miscella- 
neous objects, some of which were similar but not identical (e.g., in one 
picture the color of a sock was orange and red; in the other, white) and 
each was distinctively labelled by referring to one drawing in the center (an 
ostrich and a giraffe: two familiar animals). Опе 8 who was interviewed in 
English could unerringly indicate, during the period of short-time recall, the 
Picture in which a designated object had been displayed by consulting, he 
said, PI both of which he simultaneously brought back when instructed to 


do so; but one detail was missing: 


What about the spade, can you see the spade? 

I didn’t see the spade. 

You haven't seen the spade? 

Not yet. 

Well, how does it happen that in bo 
giraffe picture you have seen everything except the spade? 


Yes. 
Why was that, do you think, why, why didn’t you notice the spade, why 


th the ostrich picture and the 


can’t you see it now? 


Because I didn’t see it. . . 
You didn't see it—well, then, you see in the pictures in your head only 


what you have seen— 
No, I see them now. 
You see them now, you see them now, 


I didn't see it. 
The tape must be heard to appreciate the vehemence with which this young 
man defended, in the italicized sentence, the subjective reality of his PI. 

No hard-and-fast conclusion can be drawn about the PI, which clearly 
have elements in common with both EI and MI. Even Jaensch (7, p. 16) 
has said that generally EI "merely have the character of Pictures, which, 
nevertheless, are seen in the literal optical sense." The possibility that PI 
are nothing more than MI cannot be excluded. The Kamba Ss belong to 
a different culture; hence may have been encoding their internal responses 
in a manner that misleadingly suggested resemblances to EI. Tentatively, the 


but you don’t see the spade. 
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author thinks of PI as a form of MI that retains almost all the usual attributes 
of EI or as a form of EI that is coming to resemble MI. Such a guess springs 
from the promising but unproved assumption that EIs are modally present 
in traditional Kamba society and from the fact that MIs are the modal images 
in Western society; consequently, at this historical moment the Kamba may 
be in a transitional stage. Even the completely illiterate Ss in the sample who 
had never visited Nairobi nevertheless had frequent contacts with many aspects 
of Western civilization. They constantly saw automobiles, they heard loud- 
speakers blare in their markets, they were influenced by Roman Catholic 
and Salvation Army missionaries active in their area, they received orders 
and advice from government officials who exemplified some Western customs, 
a few had radio sets at home, etc. 

Careful questioning and various types of analyses have revealed that the 
accuracy of recall was by and large unrelated to imagery. This finding is 
surprising. Ss in published research (e.g., 2, p. 424; 9, p. 192), as well as 
many Ss in the present study, report that EI or PI help them store informa- 
tion. Indeed, again and again other investigators have described Ss who are 
able to use their images to read materials (such as words from a foreign 
language) which presumably they could not have memor 


ized while perceiving 
the original stimulus (1, p. 109) 


; yet a definitive summary has suggested : 


It happens only exceptionally that all the details of the stimulus object 
are correctly reported in the EI. There seems to be a continuum from a 
“blurred shadow” to almost photographic fidelity (8, p. 658). 


All that can be said 


; in commenting upon these conflicting findings, is that 
Kamba Ss (when fai 


ling to recall certain details either from EI or PI) 
without exception replied in one of two ways. At some time during the testing 
all expressed themselves in the present tense: “I see the numbers but I don't 
know what they are,” “I do not see it now,” “It is not clear,” “If I 
remember something, I see it.” Or else they would say in the past tense that 


‹ Rs i Е Ag 5 
к" forgot to see” certain items during the original exposure. Typical is the 
following inquiry concerning a PI: 


Do you see this picture in your head? 
Yes. 


Is it clear? 

Yes. 

What can you see? 

I see three cars and people. 

Can you see the license numbers on the cars? 


LEONARD W. DOOB 19 


I see them, but I did not observe them. 

Can you observe them now? 

Yes. 

Can you tell us what the numbers are? Look at the picture in your head 
carefully and tell us what the numbers are, please. 

I did not look carefully at the numbers. 

Can you look and see them now? 

No. 

Another $ complained, when he could not report the letters and numbers 
from a montage he had just perceived: “I wanted to put them in my mind, 
but then you took the picture away so quickly.” The $ above who could not 
see the spade in his PI of the “ostrich” and “giraffe” pictures was then 
shown the original pictures a second time and the spade was pointed out 
to him; thereafter, he indicated and demonstrated that both of his PI 
contained that object. In the present study there were no instances, as reported 
by other investigators (8, p. 648), of the gradual appearance in EI of details 
which at the outset Ss claimed to be unable to “see.” 

In brief, the contents of EI or PI seemed to include only information that 
had been perceived as carefully as any subsequently stored through some 
other type of encoding. In addition, that storage resembled other forms of 
it has been shown that what emerged in long-term 


memory. By and large, | 
ate report and of short-time recall. $$ 


recall was repetition of the immedi 
with such images tended to group numbers and frequently reported letters 


not in their original order. Nobody could “see” very efficiently either groups 
of letters or numbers, especially when instructed to report them backwards. 
Fragmentary evidence indicating the later recall of information not originally 


reported or recalled and a shift in imagery over time, however, suggests some 


of the complications that may eventually need to be analyzed in studying 


retention. 
The statistically significant tendencies during immediate recall for Ss 


with OI to provide more information per inquiry than those with EI or PI 
and for Ss without EI to provide more information spontaneously than those 
with EI suggests the possibility that EI is a less efficient way of storing or 
coding information, Thus, some Ss with OI gave long and detailed reports 
concerning what they had seen; whereas generally Ss with EI or PI required 
a series of questions to elicit the same amount of information. It looks as 
though the ability to verbalize may have been playing an important role, and 
Such an ability is likely to be positively correlated with schooling which in 
turn has been found to be negatively (if not significantly) correlated with 
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the incidence of EI. At any rate, perhaps Ss with OI had already coded their 
impressions in a readily reproducible, coherent form. In contrast Ss with 
EI or PI could continue to “see” details in the images facing them or inside 
their heads while for the first time, in whole or in part, they transformed those 
images into words in order to respond to the investigator’s questions. 

In addition, the images of the Kamba may not have been more efficacious 
since they were being employed to perform an unusual task. Were this so, 
then accuracy ought to have been greater in recalling details from the more 
familiar African photographs than from the Western-style drawings of Haber 
and Haber; but no differences emerged. If before the coming of the British 
and other Europeans Kamba imagery functioned as it does now, then hypothe- 
ses suggesting that EI help nonliterate people store essential information about 
themselves and their society must be tentatively rejected. The present contacts 
of the Kamba with Western society, however, may be the cause of a decline 
in the use of imagery to mediate recall. Or else the Kamba, both formerly 
and now, may be like Western Ss. Els are strong among children and not 
among the adolescents and adults composing the sample that was examined, 

That Ss with EI could not be differentiated from those with OI ona 


the largely negative findings reported here suggest the need for further in- 


nd ecological and social factors 
- The almost significant differences 


€ - A » 
or meaning" or they seem more “reap” or "compelling?" whatever it is that 
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as paintings or print) and thereby have enabled them to preserve their impres- 
sions from the past not more accurately but more vividly; and they still may 
be serving that function. Indeed, the author has the undocumented impression 
that Ss with EI or PI answered questions about the pictures they had viewed 
with greater certainty than those lacking them. As would follow from their 
ethnopsychology, people who believe in the efficacy of the images are likely 
to feel confident while experiencing them. Such hypotheses must be cautiously 
advanced, however, in view of the fact that in this study ¢ype of imagery 
Was not associated with expressed preference for pictures. Of the 14 Ss who 
liked an African scene best of all, five reported it had produced an EI; two, 
a PI; and seven, OI. 

Possibly the most important problem to be investigated in future research 
is the isolation and identification of the factors in the milieu and in the 
individual’s own life experience that reinforce or extinguish this ability to have 
or to believe one has EI or PI. Even if the negative correlation between the 
Possession of EI and contact with Western culture that was noted clearly in 
the Ibo study and less clearly among the Kamba were confirmed, it would still 
be necessary to locate the precise experiences that encourage the images in 
traditional African societies and not in Western societies. For those Africans 
who become acculturated during their own lifetime and yet continue to 
report EI or PI, the conditions in early childhood that make the ability so 
strong that it can withstand in the future the forces that ordinarily destroy 
them must be established. Do those conditions involve the age at which contact 
with the West (e.g., learning to read or to encode experience in new ways) 
has taken place? Are EI and PI replaced by MI when individuals are less 
dependent upon and feel themselves better able to cope with the external 
world? 

The process of learning and unlearning the proclivity toward EI or PI 
may very well differ from the usual kind of social learning in which parents 
Ог peers deliberately or unconsciously employ a system of direct rewards and 
Punishments to affect or control observable behavior. In contrast, images 
are obviously private, nonobservable phenomena that can be only indirectly 
affected through external events. A growing boy can be shown how to hold 
his hand in order to discharge a weapon, but he can scarcely be guen (even 
if someone wished to do so) comparable tutelage concerning the pictures in 
his head. On a purely anthropological level, it will be important to investigate 
Whether the existence or incidence of EI and PI is correlated positively with 
some customs and beliefs (e.g., ancestor worship) or negatively with others 


(e.g, a System of mnemonic devices for recording events). 
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G. SUMMARY 


On the basis of conventional and rigorous criteria, 20 per cent of a sample 
of Kamba adolescents and adults in Kenya (N = 49) were found to possess 
eidetic imagery. That proportion, also emerging in another African society 
(the Ibo of Nigeria), is many times higher than the incidence in the West. 
Almost all the Kamba so tested claimed to have “pictures” in their heads of 
past events: a kind of imagery shown here to be both different from but 
related to eidetic and memory images. Neither eidetic nor Pictorial imagery 
aided immediate, short-time, or long-time recall. If anything, the absence 
of these images was associated with better recall right after seeing the 
stimulus pictures. The presence of eidetic imagery was not associated with 
performance on a number of psychological tests nor with most census-type 
information, although there was a slight tendency for it to be negatively 
correlated with knowledge of English and formal schooling. Most 8s claimed 
that pictorial images are helpful to them in their daily lives: for example, by 
enabling them to recall the dead. The tentative conclusion reached is that 
eidetic and pictorial images may enable people to recall the past not more 
accurately but more vividly and confidently. 
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CREATIVITY, BRAINSTORMING, AND ORNERINESS: 
A CROSS-CULTURAL STUDY* 


Department of Psychology, San Francisco State College 


Henry Cray LINDGREN AND Freprica LINDGREN 


A. INTRODUCTION 


In recent years the topic of creativity has been attracting an increasing 
amount of attention on the part of leaders in the fields of government, in- 
dustry, and education. There seems to be a widespread feeling that yesterday’s 
methods are not applicable to today’s problems, that new imaginative solu- 
tions are needed—solutions characterized by ingenuity and, of course, by cre- 
ativity. This urge to seek creative solutions appears to be especially acute 
among those who work with and in the developing countries. 

An important question is, “can creativity be developed, stimulated, or other- 
wise encouraged and facilitated or are we dependent on those genuinely 
creative individuals, who apparently are quite rare and unable to stretch 
their resources to meet the demand for creative problem solving?” 

One of the more prominent proponents of the idea that creativity can 
be stimulated is Alex F. Osborn (5), who in 1939 introduced a technique 
he called “brainstorming,” an approach to problem solving that makes use 
of uninhibited group discussion. With brainstorming, all ideas put forth by a 
group member are accepted without question, and no ideas can be criticized; in 
fact, members are urged to build on one another’s ideas. Osborn maintains 
that brainstorming has a stimulating and facilitating effect and that people can 
uch groups than they can working alone. 

Although at least two research studies (2, 6) have shown that individuals 
brainstorming alone are more creative than when brainstorming in a group, 
data from one investigation (2) imply that the creativity expressed when 
brainstorming alone can be increased if preceded by group brainstorming. 
This finding was confirmed by a study (4) conducted by the present authors, 


With groups of students at San Francisco State College. The authors’ study 
hich subjects were asked to write 


held under brainstorming 
ritten during the 


Produce twice as many ideas in s 


captions for cartoons alone; then, in a group session 


rules; finally, alone. The level of creativity of the captions W. 
——— 

* Received in the Editorial Office on May 
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3, 1965, and given prior publication in 
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third phase (after the brainstorming) was judged to be higher than that of 
those written during the first phase. The number of captions produced during 
the third phase was also higher, although the increase was statistically 
significant only for women. 

In the foregoing study, the authors used a forced-choice questionnaire 
(the Asymmetrical Preference Test or APT) based on two sets of self- 
descriptive adjectives: one set chosen by persons preferring asymmetrical 


East cultures, who ordinarily function in non-English languages and (b) to 


ї regarding attitudes characteristic 
of more-creative American subjects (and measured by the APT) characterize 


Middle East Arab countries (Lebanon, 


Syria, Kuwait, Ira { bia, 
Jordan, and Sudan), with a few from taq, Saudi Arabi 


Turkey, Cyprus, Iran, and Pakistan. 
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All class members present on the day of the experiment were permitted to 
Participate, but the response sheets of all those listing English as their first 
language were eliminated from further consideration. The statistics reported 
are based solely on the responses of subjects who reported some language other 
than English as their primary tongue. Arabic was the language more commonly 
reported as a first language; Armenian was second. 

In Phase I (individual brainstorming) of the experiment, subjects were 
given a free-form cartoon and were asked to write as many humorous or 
interesting captions (or “whatever comes to mind”) as they could in a six- 
minute period. In Phase II (group brainstorming), subjects were organized 
into groups of four or five, given a second cartoon, and told to follow the 
Same instructions as for Phase I, except that they were to function for 10 
minutes as a discussion group operating under brainstorming conditions. In 
Phase III (individual brainstorming) subjects were given a third cartoon 
and told to write captions, following the same instructions and observing the 
Same time limit as in Phase I of the experiment. 

After the completion of Phase III, subjects were asked to complete the 
APT. They were given Arabic translations for some of the more complicated 
terms, and further definition was provided orally on request. In spite of the 
fact that the questionnaire was not in the native language of the respondents, 
it proved to be almost as reliable as it was when it was used with San 

Tancisco subjects in the original study (.86 vs. .89). 

Three judges operating independently rated each of the response protocols 
for Phases I and III for overall creativity. Judges were instructed to base 
their ratings on the extent to which protocols could be characterized as being 
Original, appropriate, humorous, or interesting. Protocols were first sorted 
randomly into batches of 16, and each batch was rated on a seven-point 
Scale in such a way that judgments were forced into an approximation of the 
Normal curve, Опе response sheet in each group of 16 was assigned the rating 
ӘР] (Ты. creative), and one was assigned a rating of “7 (most creative) ; 
two sheets were assigned the rating of “2,” and two were assigned a rating of 
“6”; and so on. When each judge had completed his rating of all protocols, 
the Protocols were randomly sorted before new batches of 16 were prepared 
for the next judge. This resorting of protocols probably reduced the inter- 
Judge reliability because each protocol was compared with up to 15 new 
Protocols each time it was judged. The resorting, however, did help SOMBER 
Sate for the fact that a mediocre protocol might be overrated because it 

аррепед to appear in a generally uncreative batch or might be underrated 
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because it appeared in a generally creative batch. In spite of the deliberate 
introduction of such random elements, the interjudge reliability was high. 
Kendall’s coefficient of concordance is a highly significant .76. Judges’ ratings 
were summed to give each subject two level-of-creativity scores: one for his 
Phase I protocol; one for his Phase III protocol. 


C. RESULTS 


'Table 1 presents results that in the main are consistent with those obtained 
with the San Francisco sample on which the original study was based. The 
experimental group made significant gains both in the number of responses 
and in the level of creativity, whereas the control group did not. Indeed, 


TABLE 1 
DIFFERENCES IN THE NUMBER Or RESPONsEs, LEVEL-OF-CREATIVITY SCORES, AND 
ASYMMETRICAL PREFERENCE Test (APT) Scores 


Phase I Phase II 
Variable N Mean SD Mean SD t 

Number of responses 

Men 77 4.36 241 5.07 2.41 2.84* 

Women 28 4.61 2.05 6.17 2.19 3,31* 

Control 29 5.12 2.18 5.22 2.56 .26 
Creativity level 

Men 77 11.03 3.41 13.07 3.77 3.550% 

Women 28 11.96 2.90 14.04 2.98 2,979 

Control 29 13.88 3.76 13.22 3.29 81 
Asymmetrical Preference 

Test 

Men 77 9.82 4.72 

Women 28 10.00 4.14 19 

* 5 «.01. 
** p< .001. 


the method may be considered to have been even more successful with the 
Middle East subjects, inasmuch as gains in number of responses were achieved 
by both sexes in the experimental group; whereas the gain in this variable 
had been significant only for the women in the American group. Incidentally, 
subjects in the San Francisco study produced approximately 75 per cent 
more responses in each phase of the experiment than did the Middle East 
subjects; a difference that is understandable when one considers that the 
Americans were operating in their primary language. 

Table 2 shows that the significant and Positive relationship between APT 
scores and level of creativity found with the San Francisco subjects does not 


hold up well in the Middle East. Positive correlations of any appreciable 


bo 
N 
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TABLE 2 
CORRELATIONS OF ASYMMETRICAL PREFERENCE TEST (APT) SCORES WITH 
NUMBER OF RESPONSES AND LEVEL-OF-CREATIVITY SCORES 


Variable Men Women 
Number of responses 
Phase I 12 .26 
Phase III 13 .38* 
Creativity level 
Phase I .03 42% 
Phase III 13 .22 
525077 
* $«.02. 


magnitude appear only in the results for women, and two of these fall below 


the .05 level of significance. 
D. Discussion 


The positive results obtained as a result of group brainstorming imply that 
the stimulation of individual creative thinking by such methods may be 
Possible even when participants have varied cultural backgrounds. It may 
be that some general principles of group learning work to produce this effect: 
for example, it is possible that the group-brainstorming session gave partic- 
ipants an opportunity to learn how their assigned task might be performed 
more effectively. It may also be that group brainstorming boosted the 
interest level of the task. The fact that the level of creativity of the control 
Stroup declined slightly between Phases 1 and III may be due to flagging 
interest, In other words, whatever the control group gained from practice was 
cancelled by motivational loss. | 

Although the findings of the present study may be considered encouraging 
for proponents of group brainstorming, particularly for those who might be 
Considering its use in “developing countries,” several limitations should be 
Noted. The fact that group brainstorming facilitated creative thinking in 
English does not necessarily mean that it would have a similar „effect in other 
languages, Although a relaxed, informal atmosphere is fundamental to 
Successful group brainstorming, it may be difficult to create such an atmos- 


Phere—at least among people who are not already on an intimate basis 


Socially—especially if communication takes place in the Mediterranean 
languages (including Arabic), because those languages place great stress 
on the formal, syntactical aspects of communication. Some Mediterranean 
languages do not have any words that convey the sense and feeling of the 
English word “informal.” Whereas English-speaking people (especially 
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Americans and Canadians) cultivate an informal way of speaking to put 
others at ease, informality with strangers is not considered to be a desirable 
thing by Mediterranean peoples, and their languages are not structured in 
ways that encourage such a relationship. It may be, therefore, that group 
brainstorming proceeds more effectively in English, even with those for 
whom English is a second or a third language. 

Languages in which the formal aspect rather than the semantic aspect 
is stressed may create a second kind of problem. A preoccupation with finding 
the exact or most elegant turn of expression encourages what Guilford (3) 
terms “convergent thinking,” whereas the success of brainstorming depends 
on the ability to speculate and ramble over a vast range of ideas—i.e., an 
ability that Guilford calls "divergent thinking." Brainstorming experiments 
in Mediterranean languages should be undertaken to determine whether 
or not languages that emphasize formality and structure interfere with the 
development of divergent forms of creative thinking. 

Although the present research presents positive findings regarding the 
value of group brainstorming to facilitate that type of individual brainstorming 
concerned with verbal problems (like the writing of cartoon captions), it 
leaves unanswered the question of whether or not creative solutions of 
mechanical and technical problems or problems involving broad social implica- 
tions can be facilitated by group brainstorming. 

The present research raises some question about the generality of the 
relationship between the more-or-less asocial self-concepts of persons preferring 
asymmetrical art (as measured by the APT) and creative ability. Correlations 
between the APT and level of creativity in the San Francisco study ranged 
from .24 to .36, and all were significant at the .05 level of confidence. The 
failure of similar results to appear in this study may imply that whatever 
attitudes are measured by the APT interact with creativity only in American 
culture. This conclusion is confounded by the fact that correlations between 


various measures of creativity and the APT approach significance for Middle 
East women. 


range of the APT tended 


i t hose scoring high or low. 
The mean gain for those men scoring in the middle third of the APT was 3.77 
on the level-of-creativity scale, as contrasted with 1.23 for the bottom third 


and 1.31 for the top third. The difference between the mean level-of-creativity 
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score for this middle third and that of the top third and bottom third combined 
was significant at the .01 level of confidence. This may mean that a moderate 
degree of the kind of nonconformity the APT measures interacts with group 
brainstorming to facilitate creative thinking among Middle East men—with 
a high degree of conformity or nonconformity being less conducive to a 
Positive response to this kind of treatment. 

The second finding produced by the additional analysis is that the distribu- 
tion of male scores on the APT was bimodal, with one mode near the bottom 
and another near the mean. This type of distribution did not appear in the 
Scores for women. Such bimodality suggests that the APT scores for a 
number of the male subjects may be understated. Such an interpretation makes 
sense in view of the fact that there is a general tendency for Middle East men 
to inhibit the expression of any self-derogatory attitudes, such as the kinds 
of attitudes (bitter, temperamental, pleasure-seeking, sarcastic, and the like) 
that are scored "plus" on the APT key. Fake-good tendencies exist with 
American subjects, of course, but the cultural bans against a man admitting he 


: has such feelings are considerably stronger in the Middle East. Furthermore, 


being frank, especially being frank to psychologists, is much more of a 
"virtue" in the United States than it is in the Middle East. 

In any event, serious question is raised about the desirability of using the 
APT as a predicter of creativity for Middle East men. Either the relation- 
Ship observed in an American sample does not extend beyond the cultural 
boundary of the United States or a sizable proportion of Middle East male 
University students cannot be depended upon to give frank and self-revealing 
Tesponses on a personality questionnaire of the type used. 


E. SUMMARY 


Middle East university students for whom English is not a primary language 
Participated in a three-phase experiment in which they wrote English cap- 
tions for cartoons first alone (individual brainstorming), then in a small 
group (group brainstorming), and finally alone (individual brainstorming). 
Both men and women displayed significantly more creativity in the third 
Phase, as compared with the first phase, using number of responses and 
Tatings of level of creativity as measures of creativity. A control group did 
not show any such gains. These findings generally are consistent with those 
ОЁ a similar study conducted with college students in the United States 
and imply that group brainstorming may function as a facilitator of creativity 
in cultures other than American, although research should be done to determine 
whether or not brainstorming would succeed in languages other than English, 
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Measures of creativity failed to correlate significantly with scores made by 
men on a self-report, forced-choice questionnaire scored in terms of choices 
made by persons preferring asymmetrical to symmetrical art; although correla- 
tions approached significance for women. Significant and positive correlations 


between scores on this questionnaire and level of creativity had been found 
for both sexes in the United States. 
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QUANTITATIVE ASPECTS OF THE GROUP-PSYCHO- 
THERAPIST’S ROLE BEHAVIOR: 
A METHODOLOGICAL МОТЕ* 1 


Research Center for Group Dynamics, University of Michigan 


Jerome Rasow 


A. INTRODUCTION 


One crucial requirement for a critical evaluation of any treatment program 
is the description and measurement of the process of treatment. This descrip- 
tion and measurement becomes especially important when group treatment 
is applied because the focus and manipulation of multiple variables is integral 
to any dynamically oriented group approach. The complexity of such manipula- 
tion suggests the importance of detailed and systematic observation ; yet most 
investigations of group-treatment approaches occur on clinical and descriptive 
levels (2), Also, these descriptions tend to be generalized rather than specific 
(6, 7, 8). Any attempts, therefore, to observe in a systematic, parsimonious, 
and reliable way the extent to which the group therapist conforms to 
Principles of treatment will be valuable for comparing the extent to which 
Eroup therapists differ and concur in their treatment practices. This note 


represents such an attempt. 

This Paper examines some quantitative aspects of role behavior of a 
Stoup psychotherapist who operates according to the principles of Guided 
Group Interaction. Elsewhere, the treatment system (4) and principles of 
this Particularized form of group psychotherapy have been stated in detail 
(1,3, 9, 10). The focus of the present paper is to suggest that quantitative 
Parameters of therapist's behavior may indicate the extent to which practice 
Proceeds according to particular principles of treatment? It may signify 
the importance of the quantitative aspects of the group psychotherapist's role 


behavior аза variable for detailed study. 
ls ce 
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B. QUANTITATIVE ASPECTS ОЕ ROLE PERFORMANCE 


One of the specific principles of Guided Group Interaction revolves around 
the activity of the group leader. Since the theoretical assumption for causation 
involves the view that most delinquent behavior is purposive and adaptive, the 
therapist (rather than waiting for the offender who is committed to his own 
way of life to evaluate his own motives and values) should actively involve 
himself in posing alternatives and in challenging the neutralizations and 
rationalizations used by offenders (5, 11). These efforts at posing alternatives 
have to be interwoven with efforts to create an atmosphere in which boys feel 
free to discuss their behavior. Since the therapist also attempts to steer the 
greater therapeutic potential, his 
velops and begins to discuss and 
has guided its members. A first 
tes at a more active rate in the 
also, that the total number of 
is greater than the number of 
en, of therapist behavior studied 
er of statements per session. 


e challenges or interventions are 
uent boy involved has a purposive 
a elinquent is a boy who has rarely 
if he has, like other delinquents 
ationalizations that successfully justify 
jection by representatives of conventional 
entional norms the delinquent may have 
e challenges аге: “Why did you do that?” 
ay you thought you wouldn’t be caught?” 


"Joe really told us how he felt tonigh 
to make progress with Pete's problem ?” 
Statements tend to be brief and compa 


Ina later stage of group development, the therapist tries to make members 
aware of discrepancies between behavior and stereotypes and aids in articu- 
lating unconscious motivation. Also, the group leader can Provide terminology 
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and frames of reference for classifying individual behavior. Since the thera- 
Pist’s role changes from an active directive, challenging, and supportive role 
to a less active and less directive interpretative role, the statements he makes 
in the early sessions tend to be shorter and more compact than statements 
made in the later stages of group development. This aspect of the therapist’s 
behavior was also evaluated and is the third focus of this study. 


C. METHODOLOGY 


The study described here is based upon a 10 per cent sample of all recorded 
Statements made by the therapist. The therapist whose behavior was examined 
Worked with a group of boys who met for a total of 76 sessions. These 
Sessions were held over a period of four months, and the number of meetings 
used in the analysis numbered 66. Six meetings were not recorded due to 
mechanical difficulties with the recording equipment, and four meetings were 
not used because the group leader, whose behavior is the focus of the study, 
Was not in attendance. 

"Through the use of a table of random numbers a random start was made in 
each of the 66 meetings, and every lOth statement that the therapist made 
Was transcribed and coded. 

A statement was defined operationally as a remark made by the therapist 
that communicated some idea or thought or asked a question. 

Remarks that communicated nothing beyond the fact that the therapist 
heard the speaker's comments were not included in the sample. Remarks by 
the therapist indicating that he had heard the speaker and wished him to 
Continue or to repeat his comments were also excluded. Examples of these 
remarks are; "Yeah," “Uh-hum,” or "Will you repeat that?" If a statement 
Was inaudible because of recording difficulties or interrupted by another group 
member, it was also omitted from the sample. Of the 41 statements that were 
Not utilized because of the foregoing criteria, 29 were interrupted statements, 
nine were inaudible, and two were statements that were repeated. In each 
of the 41 cases, the next remark by the therapist was selected for analysis. 
All of the uncounted, interrupted, inaudible, and repeated statements oc- 
Curred within the first 33 meetings. This fact suggests that after а certain 
Stage of development group members tended to allow the therapist to com- 
Plete his remarks rather than interceding in attempts to deflect intervention. 
Communication between therapist and group members seemed to improve 
with the group’s development, as members tended to interrupt less frequently. 
The 10 per cent sample of statements for the 76 meetings comprised 1074 


Statements, 
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Sampling accuracy was diminished by three factors. (a) All of the thera- 
Pist’s statements were not recorded. Aíter one hour a tape had to be turned 
over; so about 30 seconds of meeting time went unrecorded. (5) Ten meet- 
ings were not used in the analysis. (c) Occasionally, the boys would start the 
meetings without waiting for the observer who was responsible for starting 
the tape. These types of errors are small, they occurred on a random basis, 
and were relatively infrequent. 


D. RESULTS 


ssions 


recorded se: 


100 


Number of Statements 
for each of cleven 


23-33 34-44 45-55 56.66 
Meeting Numbers 
FIGURE 1 


NUMBER oF STATEMENTS MADE BY THERAPIST 
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Spearman rank-difference correlation (using meeting number with correction 
for ties) is —.74. Kendall's tau is —.53. 'Thus, the longer the therapist 
meets with his group the less active he becomes. 'This result indicates con- 
formity to the role prescriptions of Guided Group Interaction. 


2. Rates of Interaction 


To determine the rate at which the therapist interacts with group members, 
the amount of footage between every 10th statement was computed. 

The average amount of footage between randomly selected statements for 
the 66 meetings was 54.1. In interaction rates, the therapist made a 10th 
statement every 268 seconds or every half minute. 

When the average rates of interaction in each of the 66 meetings are 
compared with the average rate of interaction for all 66 meetings, one dis- 
covers 17 meetings for which the rates of interaction fall within two standard 
deviations of the average interaction rate for all 66 meetings. Table 1 shows 

TABLE 1 


Tue NUMBER OF SIGNIFICANTLY НіснЗ AND 
SIGNIFICANTLY Low INTERACTION RATES 


Meetings 
Interaction rate 1-33 34-66 Total 
Significantly high 19 2 21 
Significantly low 2 26 28 
21 28 49 


Total 


cantly higher in the first half of the group 


sessions as compared with that in second half of the group sessions. The chi 
square of 33.9 is significant beyond the .001 level. The data demonstrate 
that the therapist takes a more active role initially in the group meetings and 


decreases his participation as the group develops. 


that therapist interaction is signifi 


3. The Number of Words in a Statement 

er of words for each statement is 10.5. This indicates 
ke statements that are short and compact. These 
s or explanations, but statements that ques- 
tion or speculate about an individual in the group. The therapist's statements 
might be seen as stimuli that other group members might pick up and develop 
and utilize for purposes of making their own interpretations or to test their 


own reality perceptions of the situation. 


nse of the number of words falling above 


The average numb 
that the therapist tends to ma 
statements are not interpretation 


3 Significant is used in the statistical se 
and below two sigmas from the mean. 
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The reader may recall that the role prescription for the group leader in 
the initial stages of Guided Group Interaction prescribes that the therapist 
challenge the assertions and rationalizations of the group members. Thus, the 
group leader, if he is fulfilling the role prescriptions, tends to utilize other 
group members for the purpose of making interpretations. As the group 
develops into an effective treatment unit, the therapist can make interpreta- 
tions like other group members. This latter type of statement tends to be 
longer than remarks designed to challenge group members. As suggested 
earlier, it might be expected that the length of statements increases as the 
meeting develops. 

Table 2 indicates the meetings in which the therapist made statements 


that are significantly longer and shorter than the overall average of 10.5 
words per statement. 


TABLE 2 
THE NUMBER OF SIGNIFICANTLY LONG AND 
SIGNIFICANTLY SHORT STATEMENTS 


Meetings 
Statements 1-33 34-66 Total 
Significantly long 8 20 28 
Significantly short 19 5 24 
Total 27 25 52 


The chi square for Table 2 is 12.1, significant beyond the -001 level; thus, 
there is a significant association between the length of the therapist's state- 
ments and the length of time that the group has met. 


E. Summary AND CONCLUSION 


In assessing the role prescriptions of a group-therapist activity, 


Е the number 
of statements per meeting, 


the rates of interaction per session, and the length 
of statements may be considered important parameters of behavior, A 


forward counting process was used to study and evaluate 
dimensions of the behavior of one therapist. Role descriptions fo 
in Guided Group Interaction were described and evi 
role requirements was presented. 

The attempt to measure the role performance of 
is guided by particular treatment principles shows t 
role prescriptions were followed. Both the number 
rates of interaction decreased significantly over time. 
of statements increased significantly during the later 

In the group observed, the therapist adhered to 


straight- 
the foregoing 
r the therapist 
dence for adherence to 


a group therapist who 
hat all the quantitative 
of statements and the 
Furthermore, the length 
therapy sessions, 

the limited quantitative 


P 


ap 


JEROME RABOW 37 


treatment principles specified by the theory of treatment. This note suggests 
that the attention to quantitative parameters may be one method by which 
systematic differences between therapeutic schools may be discovered. 


11. * 
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A DELAY-OF-REINFORCEMENT EFFECT 
IN PERSUASIVE COMMUNICATION ?* 


Communication Research Division, Boston University 


RarpH L. Rosnow? 


A. PROBLEM 


In a recent study by Rosnow and Russell (3), opposing persuasive com- 
munications were read to high-school students either just prior to the onset 
of a “reinforcing” event or immediately following that event. When com- 
munications preceded reinforcement, findings supported an hypothesis de- 
rived from the earlier Law of Effect; i.e. respondents’ opinions were found 
to have been significantly changed in the direction advocated by the one com- 
munication (either pro or con) that was closer in time to the reinforcement. 

d-of-effect hypothesis, had been posited 


A similar association, based on a sprea 
for a case when communications followed reinforcement. These results were 


not statistically significant ( > 05), and this second hypothesis was rejected. 
In that study as well as in one by Rosnow and Lana (2), “reward” was 
characterized as the relief that ensued after release from a punishing situation. 
Findings lent support to the contention that a tendency to respond in the 
directions advocated by the communications would be strengthened for the 
one communication preceding and closer in time to the rewarding event. 
When communications followed that rewarding event, it has been suggested 
(3) that since communications are not instantaneous stimuli, a temporal lag 
must exist between the onset of the stimulus complex (the communications) 
and the opinion reaction. When communications preceded reward, this 
reaction led into the rewarding event. However, when communications fol- 
lowed a rewarding event, the delay in opinion reaction time may have far 
exceeded what would have been optimally appropriate to link the communica- 
tion nearer to the rewarding event with that event, due to “decay” in the 
reward gradient. The connection would have been expected, therefore, to 
be weak; and a nonsignificant order effect would appear plausible. 
orial Office, Provincetown, Massachusetts, on March 25, 1964. 
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The present paper is a report of an experiment designed to explore the 
effect of a “punishing” event on opinion change, when that event was either 
closely preceded in time or closely followed in time by opposing Persuasive 
communications. Following the rationale of the two other studies (2; 3), it 
was now contended that a tendency to respond in the directions advocated 
by the communications would be weakened for the one communication 
closer in time to a punishing event. When both communications preceded 
punishment, it was hypothesized that any opinion change would be in the 
direction of primacy (i.e, the direction advocated by the first of the two 
opposing communications, the one farther in time from the punishing event). 
When both communications followed punishment, this rationale would now 
suggest, if anything, a weak recency effect (i.e., opinion changed 


direction advocated by the last communication, the one farther in tim 
the punishing event). 


in the 
e from 


B. PROCEDURE 


Ss were 84 third-year and fourth. 
chology classes, all of whom had no p 
Pretest opinion questionnaires wer 
scheduled class meeting, 
the University who is 
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Questionnaires were completed, following which E th 
and the students for their cooperation and left the classro 


between pretest and treatment, thus reducing the effect 
naire responses and the 


variables. On the seventh 
(А; and A5) by expressing his d 
homework and announced a sur, 
The quiz consisted of two open- 
class but the content of which wi 
permitted only five minutes to 
collected their papers, the inst 
con communications? 
classes). 


-year high-school students in four psy- 
rior exposure to this type of experiment. 
€ administered by E during a regularly 
at which time he was introduced as someone from 
"polling students opinions on the United States’ use 


anked the instructor 
om. One week passed 
of recall of question- 
tion as complicating 
о of the four classes 


as of an esoteric nature, and the students were 
record their answers. Immediately upon having 
ructor reintroduced E who then read pro and 
to Ss (using reversed orders of presentation for the two 
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In the remaining two classes (As and A,), E read the communications 
immediately prior to announcement of the “punishing”? quiz. Posttest 
opinion questionnaires were completed by all Ss after completion of the 
punishing event and reading of the communications. 

A fifth group of students, using a Likert-type questionnaire, rated the 
communications for direction and strength and rated the topic for its per- 


ceived controversialism. 
C. RESULTS 


'The questionnaires and communications were the same as those used in 
Rosnow and Russell’s study. The pro and con communications were around 
425 words long and took a little over four minutes to present. Pretest and 
posttest opinion questionnaires were identical, each consisting of five five- 


point Likert-type items. The range of scores was five through 25, with five 


indicating the most unfavorable attitude and 25 indicating the most favorable 


attitude toward “use of nuclear weapons." 

For the group of students who rated the communications for direction and 
strength and the topic for its perceived controversialism, results were con- 
sistent with previously reported findings (3). “Use of nuclear weapons” was 
rated as moderately controversial in nature, while the pro and con com- 
munications were rated as opposite in direction; but were not rated as 
significantly different from one another in terms of the perceived strength 
of their content (df = 21, t = 45, p > 05). 

The subtractive procedure that results in 2 tests for differences between 
pretest and posttest opinion scores for the four treatment groups confirms 
the primacy prediction but not the recency prediction. Primacy had been 
predicted when communications preceded a punishing event, and results sup- 
ported this expectation (df = 33, t= 2.1, p < .05). When communications 
followed punishment, the results suggested recency but fell at a nonsignificant 


level (df = 47, t= 3, 5 > 05). 
D. Discussion 


Whereas for Thorndike the spread of effect represented a gradient of 

reward, there has been evidence by Tilton (4) to suggest an analogous spread 
, 9 А = 

of negative effects or gradient of punishment. It was from this latter position 
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that the second of our two hypotheses, the one predicting recency, was 
79А direction was, іп fact, obtained ; but the effect was too weak 
to be taken as evidence for a "spread." This still does not rule out the 
possibility of obtaining such a spread of effect in this kind of Persuasive-com- 
munication situation, but it does suggest the necessity for either some modifica- 
tion in the length of the communications or for more sensitive measures of 
change in the immediate region of the reinforcement gradient. 

On the other hand, hypotheses derived from the earlier Law of Effect 
have been consistently confirmed (both in the case of reward as well а 
punishment). Previously, Rosnow and Russell reported that respondents 
opinions were changed in the direction advocated by the one of a pair of 
opposing arguments more immediately preceding a rewarding event. This 
suggested that contiguity between the second-presented argument and the 
reward strengthened the tendency to respond in the direction advocated by 
that argument. Now we find that opinions are changed in the direction 
advocated by the one of a pair of opposing arguments Preceding (but farther 
in time) from a punishing event. This suggests that contiguity between the 
second-presented argument and the punishment has weakened (or perhaps 
suppressed) any tendency to respond in the direction advocated by that second 


argument; this has permitted the first argument to have the greater influence 
on opinion change. 


Both the present 
a typical delay-of- 


с eve ications followed 
punishment, results (though no ignificant) Suggested a similar 
trend, Findings supported an i i from the earlier Law of 


read-of-effect hypothesis, "The present fi 


ndings, 
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considered in the light of a study reported earlier, suggest a typical delay- 
of-reinforcement gradient. 


3. 


4. 
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THE EFFECT OF “HUMAN” VS. “MACHINE” SET 
ON GROUP PROBLEM-SOLVING PROCEDURES*? 


Educational Testing Service, Princeton, New Jersey 


Joun К. Hempuity AND CaroLYN B. McConvitre 


A. INTRODUCTION 


Much research has been conducted that has attempted to contrast the 
effectiveness of the individual working alone with the effectiveness of groups 
in problem solving (1). The results of much if not all of this and similar 
work may have more meaning as more is understood about the basic character- 
istics of group tasks and how these characteristics are related to problem- 
solving processes. Concepts need to be introduced into research on group 
problem solving that are oriented toward the description of structural task 


characteristics. 

Progress has been made recently toward an analysis of some of the demands 
tasks make upon the interacting relationships among group members. One 
important step was the recognition of the distinctions between disjunctive 
as contrasted with conjunctive task requirements (10). If the relationship 
between the acts of group members and their progress toward goal achieve- 
ment is disjunctive (because of the “parallel” relationship of subtasks), the 
individual’s contribution to task achievement can be made independently of 
the acts of other members of the group. On the other hand, if relationships 
between the acts of individual group members and achievement of the group’s 
goal are conjunctive, then what the individual does may or may not be 
effective, depending completely upon what some other member of the group 


has done or will do in the future. Clearly, group tasks having disjunctive 


requirements and those primarily of the conjunctive variety may call for 


radically different procedures for their solution. 

The experiment to be described is a study of the effects of two contrasting 
“sets” introduced to the subject about the nature of the “others” with whom 
he was working, where the total task involved predominantly conjunctive 
connections between the subtasks. In the case of the “human” set, the subject 


—— 

* Received in the Editorial Office, Provincetown, Massachusetts, on March 26, 1964. 
Copyri ] Press. 

d qe rmm by a contract between the Office of Naval Research 
(Group Psychology Branch) and Educational Testing Service. 


45 


46 JOURNAL OF SOCIAL PSYCHOLOGY 


was led to believe that he was working with another human subject on the 
task. In the case of the “machine” set, the “other” was represented as a 
machine that had been programmed to respond in a complex manner to the 
acts of the subject. In the case of both sets the Tesponse actually was made 
by the “experimenter,” who followed a predetermined structure that was 
identical for the two conditions. 

The concern of the investigation is that subjects under the influence of the 
two sets may make different responses to the objectively identical structure 
they find in response to their acts. 

From a point of view suggested in a theory of leadership and group be- 


havior (3, 4, 8), the individual responds to working with others on mutual 


roduce consistency in his 
own behavior as a means of making the "other's? task of predicting his 


behavior more tractable. Thus, if the individual is working on a mutual task 
with one or more persons (and assuming high motivation), his success would 
depend on (a) the accuracy of his analysis of the task requirements, (5) his 
understanding of the probable behavior of the others, (c) his formulation of 


meet the task requirements, and (4) his com- 
intended behavior, If the individual is set to 
regard the task he faces as “mechanical” and as not involving the behavior of 
s depends only on the accuracy 


of his analysis 
n his formulation of an 


adequate course of 


roblem-solving 

"two" contrasting sets 
i Which the individual 
Which he can learn 
hom he is interacting 


can secure information about 
about consistency in the behavi 
may be very similar, 

The general hypothesis of the 


experiment may 
Individuals attempting to solve 


In other words, the problem solver, when working with other humans, will 
H H H H - , 
modify his behavior to tak е 
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Rejection of this hypothesis cannot be interpreted as suggesting that social 
factors in group problem solving are unimportant, but may lead to specifying 
and examining conditions under which differences in the individual’s approach 
to a group task might be better understood. 


B. PRocEDURES 


Twenty-four men and 24 women, matched only on educational level, served 
as experimental subjects. The men represented all undergraduate classes at 
Princeton University, and most of the women were wives of graduate students 
at the same institution. All had had some college education, but none had 
advanced degrees. No attempt was made to match subjects within pairs. 

The 48 subjects were run in pairs, women subjects separately from the 
men. Instructions were given to 12 pairs of the subjects (six of each sex) at 
the beginning of the experiment in a manner calculated to lead them to 


believe that the two members of the pair would work together as partners on 
The remaining 12 pairs were led to believe that 


the tasks they were to do. 
ould respond to a complex machine. 


they would work as individuals and w 
Each subject worked on two distinct tasks (I and II) that were counter- 
balanced throughout the experiment in ABBA order. 

The two subjects were seated in a room partitioned by a folding door, giving 
privacy to each individual. One-way vision mirrors permitted the experi- 
menters to observe the subjects as they worked. On the table in front of each 
subject was an electrical plugboard, arranged in six rows and five columns of 
one and one-quarter inch squares; each square, letter-number coded (El) for 
easy identification, contained a telephone jack and a neon light. Figure 1 is a 
simple sketch of one of the boards. 

Each subject was supplied with a set of four telephone plugs to be used 
with his board. Each plug was marked with a different color: blue, green, yel- 
low, or white. Two of the plugs were wired internally in a manner such as 
to cause the neon light in a square to come on when it was inserted into the 
Jack in that square. The other two plugs were dummy plugs and did not 
affect the lighting of the squares. - 

Instructions given to the subjects at the beginning of the experiment were 
used as the means of establishing a “human” or a “machine” set toward their 
Partners. In order to make the sets more plausible, it was necessary to include 
the following elements in the experimental situation. (These elements, al- 


though trivial in themselves, led to seemingly important differences in ex- 


perience with the tasks.) 


To create the "human" set, a subject was told that his partner was working 
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at an identical board and that his partner also had two “active” plugs and 
two “markers,” but that his active and marker plugs were of the opposite 
colors—thus Subject A controlled the blue and yellow active plugs; Subject 
B, the green and white ones. Each subject was permitted to move one plug at 
a time, taking alternate moves. 

The “machine” set was created by instructions explaining that the boards 
were connected with an electrical switching mechanism and that the “machine” 
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plugs were to be moved according to specific rules in a manner that one of 
them would light the square in the uppermost right-hand corner of the board. 
The object was to achieve the goal in as few moves as possible. The plugs 
were permitted to be moved only in a specified color order, and the four 
plugs had to remain together as a group, maintained at all times by a hori- 
zontal or vertical connection. 

At no time did the subjects interact with each other, as they believed they 
were doing in the “human” condition. In reality, each subject’s board was 
interconnected to a separate board controlled by one of two experimenters; 
thus there were two independent sets of two boards in series. The experi- 
menters followed a predetermined structure that was identical for the two 
conditions. 

In the instructions to the subje 
ner’s” next move or the "machine's" next move 
condition in which the subject was participating. 
forms were provided for this purpose. 

Each subject was given eight trials per task or 16 trials in the total experi- 
ment. Each trial consisted of 10 moves. If the goal was reached in less than 
10 moves, both experimenters and subjects replaced their plugs in the occupied 
squares for the duration of the trial. For a perfect solution, Task I required 
eight moves; Task II, seven moves. The experimenter’s play was governed 
by rules that defined a pattern of moves that was effective in reaching the goal 
square in the minimum number of moves, providing the subject made the 
appropriate move at each opportunity. А m 

A postexperimental questionnaire was administered to determine attitudinal 
differences on the part of the subjects participating in the two conditions. 
The instrument was patterned after Osgood’s method of semantic differentials 
and ascertained attitudes pertaining to (a) the experiment as a whole, (5) the 
subject's own play during the experiment, and (c) the responses made to his 
play (7). Most of the word pairs reflected common stereotypes of differences 


between machine and human performance. 


ct, he was asked to predict either his “part- 
depending on the experimental 
Comparable mimeographed 


С. RESULTS 


The data provided by the subjects in completing this postexperimental 
questionnaire were examined by means of two 10-variate multivariate analy- 
ses of variance (2).2 One of the two analyses concerned 10 semantic-differen- 
ciation to Professor R. Darrell Bock of the Psychometric 


f North Carolina for suggesting the multivariate analy- 
lysis of the data. 
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tial scales that the subject had used to describe his reaction to the experiment 
in general. The second of the two analyses was of five scales that referred to 
the subject’s views of his acts and five scales that referred to his views of the 
responses he had received to his acts. Two experimental classifications, con- 
dition (human vs. machine) and sex (men vs. women), were considered in 
the analyses. In the first of the two analyses, the following 10 semantic- 
differential pairs were included: 


1. Friendly-Unfriendly 6. Public-Private 

2. Human-Mechanical 7. Sociable-Unsociable 
3. Meaningful-Meaningless 8. Rhythmic-Irregular 
4. Honest-Dishonest 9. Rigid-Flexible 

5. Emotional-Unemotional 10. Orderly-Confused 


The results of the analysis were not encouraging insofar 
of the over-all effect of the "human" vs. “machine” 
This effect is not significantly larger than the estim 
Sex differences may be present in the m 


as the significance 
conditions was concerned. 
ate of experimental error. 
anner in which the subject reacted to 
the experiment (or perhaps in response set to the rating tasks). The signifi- 
cance level for the test (.06) is not small enough to reject the hypothesis of 
no association with sex differences with great confidence, but produces con- 
siderable doubt. The interaction of sex differences and the experimental con- 
ditions does not appear to produce a significant effect, 

Despite the nonsignificant and doubtful outcome of the over-all effects of 
the conditions and sex differences, an examination of the 10 variates indi- 
vidually was undertaken with the hope of clarifying some of the ambiguity. 
Caution must be exercised in interpreting the results, since it has already 
been shown that the over-all analysis of the variates does not reach the .05 level 
of significance. However, the trends in the detailed data may be worth noting 
since they follow what might have been expected. Under the "human" con- 
dition, the experiment appeared to the subjects to be better described as (a) hu- 
man rather than mechanical, (5) more emotional th 
more sociable than unsociable. These seem reason 
differences there are appear to involve the tenden 
regard the experiment as more private than do the 

In the over-all analysis of the second set of 10 
"machine" conditions and sex differences appear t 
the .03 and .02 levels respectively. The interaction 

The variates considered in the second analysis 
this table it becomes apparent that under the 
jects who had been instructed that they 


an unemotional, and (c) 
able contrasts. What sex 
cy of the men subjects to 
women subjects. 

variates, both “human” vs. 
o have significant effects at 
of the two is not significant. 
are listed in Table 1. From 
“human” condition those sub- 
were working with a Partner regarded 
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the responses they received to their acts as less “clear” than did the subjects 
who had been instructed that the responses to their acts were generated by a 
“machine.” This Clear-Confused Scale was the only variate among the 10 that 
was clearly related to the “human” vs. “machine” conditions. Leading-Follow- 
ing, as applied both to the subject’s own acts and the responses to his acts, 
tends to be the next most sensitive pair. There appears to be a slight tendency 
for "leading" to be associated with the "human" condition, as applied both 
to the subject's own acts and to the responses. In other words, the subjects 
TABLE 1 


DIFFERENCES BETWEEN THE EXPERIMENTAL CLASSIFICATIONS 
FOR THE 10 VARIATES OF SET 2 


Source 
Human vs. machine Sex differences 
Variate X, m F Xn X, F 
Subjects’ acts 

Responsive-Unchanging 2.84 3.04 T 3.21 2.67 1.18 
Leading-Following 300 3.80 299 302 371 186 
Emotional-Analytical 534 546 — 1] 612 466 1544* 

2.88 2.70 46 2.66 2.92 37 


Considerate-Inconsiderate 
Cooperative-Competitive 


Responses to subjects’ acts 

Leading: Following 3.29 +01 2.63 3.92 3.42 1.17 

Cooperative-Competitive 3.04 2.71 AT 2.54 3.21 1.88 
Considerate-Inconsiderate 2.92 3.16 35 2.75 3.34 1.93 
Responsive-Unchanging 3.50 3.79 21 412 316 2.30 
Clear-Confused 3.92 246  10.00* 3.21 347 .01 


* Significant beyond the .01 level. 


who worked under the "human" condition viewed their own work as involving 
"leading"; but also apparently felt that their partners were leading" too. 


Perhaps the subjects who worked under the "human" condition were set in 


a manner that permitted them to rationalize their difficulties with the task by 


projecting some of their confusion as а characteristic of their partner. 

Sex differences in response to the experiment appear to be confined to one 
variate, Emotional-Analytical. Men subjects, as compared with women sub- 
jects, viewed their acts as more analytical or were less willing to describe 
their work as emotional. 

D. QUALITY OF PERFORMANCE 


The number of “moves remaining” to be made after the eighth move on 
Task I or after the seventh move on Task II was used for the purpose of 


determining the effectiveness of task performance. It will be recalled that 
eight and seven are the minimum number of moves required to complete 


52 JOURNAL OF SOCIAL PSYCHOLOGY 


these tasks. “Moves remaining” is a measure of distance between the actual 
performance and the best possible performance taken after the minimum 
number of moves had been made that could have completed the task. This 
distance is expressed in terms of the number of moves that would have to be 
made to bring the actual performance to the level of the best possible per- 
formance. In determining “moves remaining,” the position of the square just 
occupied at the completion of the eighth (or seventh 
the number of squares between that square and the 
with proper observation of the restriction th 
to those squares that border a square on its 
This measure reflects inadequate task perfor 
a simple count of perfect trials, 
performance, 


) move was noted, and 
goal square was counted 
at legal moves can be made only 
four sides (city-block distance). 
mance in more detail than does 
in that it differentiates degrees of inadequate 


move made and, because of the 
moves, continued throughout th 
of 24 subjects, performances on 
the first move were separated fr 


was first to move on the trial, 


It 


on those trials of Task II where the experimenter moved first and apparently 


I where they moved first. More detailed exami- 


10 level of significance, 
conditions and task order, "There is, therefore 


m 
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TABLE 2 
PARTITION OF CHI SQUARE ASSOCIATED WITH NuMBER ОЕ Moves REMAINING 
(Task I, Experimenter First, Subject Group A) 


Source df x? MS F 
1. Between conditions 1 3.409 341 2.10 
2. Between sexes 1 2.817 2.82 1.74 
3. Between task orders 1 4.761 4.76 2.94 
4. Interaction (1x 2) 1 2.817 2:92 174 
5. Interaction (1x 3) 1 6.338 6.34 3.91 
6. Interaction (2x 3) t 2.817 2.82 1.74 
7. Interaction (1x 2x 3) t 2.817 2.82 1.74 
8. Between individuals 16 25.915 1.62 

Total 23 51.690 


Task II preceded or followed Task I may have been a factor in making a 
difference between the “human” and “machine” condition. The mean num- 
ber of moves remaining for the four comparisons involved in this interaction 


are as follows: 


Machine Condition Task II Follows Task I 1.83 
Machine Condition Task II Precedes Task I 75 
Human Condition Task II Follows Task І 1.75 

Task II Precedes Task I 1.83 


Human Condition 
These data suggest the hypothesis that individuals working under the 
“machine” condition may learn from experience with a previous task, but that 
under the “human” condition no such learning occurs. This observation ap- 
Pears to be consistent with the attitudinal data in which it was shown that 
the major difference introduced by the “human” vs. “machine” sets was that 
subjects who worked under the “human” condition tended to regard their 
Partners as confused. They might not have expected to learn from the play 
of the others, and thus may not have been as attentive to this play as were 
the subjects under the “machine” set. 
'This detailed discussion of data from the first classification category of 


"Table 3 illustrates the manner in which the F ratios may be interpreted. 
Table 3 :zes the F ratios for each of the eight analyses. A definite 
мони таса be differentially difficult for the 


tendency appears for some of the tasks to | h 
men and women subjects [see Column 2]. In every case the difference was in 
the direction indicating that the men subjects were able to perform the tasks 
more effectively. Task order [see Column 3] appears significant at the .05 level 
for two of the categories of classification of tasks. These two involve Task I 
when the subject moves first. This difference 1s perhaps best regarded as a 


F Ratio SUMMARIZING THE RESULTS OF PARTITION 


TABLE 3 


S ОЕ Cur SQUARE or THE NUMBER OF Moves REMAINING 


FOR EIGHT CATEGORIES OF Tasks, Firsr Moves, AND SUBJECT Groups 
Classification Sources? 

Human vs, Inter- Inter- Inter- Inter- 
machine Sex "Task order action action action action 

Task First move Subjects (1) (2) (3) (1 x 2) (1x3) (2x3) (1х2х3) 
I Experimenter A 2.10 1.74 2.94 1.74 3.91 1.74 1.74 
I Experimenter B 2.59 9.07** 1.77 63 2.59 0+ 142 
I Subject A 01 1043** 4,52% 3.28 31 1.40 1.05 
1 Subject B 17 517% +.66* 17 327 09 03 
п Experimenter A 73 00 2.25 -18 41 2.25 0+ 
II Experimenter B .32 11.07** 32 01 1.90 .01 .53 
п Subject A 10 145 33 .00 .33 10 4.92% 
п Subject B 67 4.42 .31 1.34 .53 1.15 67 


4 All ratios use individual difference chi square as the “ 


* Significant at the .05 level. 
** Significant at the .01 level. 


error” term, 


TS 


A90'IOHOASd TVIOOS 20 1v Nunof 
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transfer of training provided by experience with Task II when it preceded 
Task I. The interesting observation is that such transfer did ло? occur at a 
significant level for the remaining six categories. The F ratio for none of the 
six reached the .10 level of significance. No ready explanation for this phe- 
nomenon seems to be available. 

The most significant finding shown by results summarized in Table 3 is 
that none of the F ratios for condition (Column 1) is significant at the .10 
level of confidence. There is no evidence suggesting that the subjects per- 
formed better under either the “machine” or the "human" sets. Thus, it would 
appear that effectiveness on the task as measured by the criterion of number 
of moves remaining does not depend upon the difference between a set to 
work with another human as a partner or a set to work with a machine as 
a partner. 

E. PREDICTIONS 

A part of the task assigned to each subject was predicting the move about 
to be made by either the "machine" or "partner" with whom he was working. 
This feature of the experiment was designed to yield information about what 
moves the subjects came to expect as the responses to their own acts. Since the 
subject made predictions only on alternate moves (i.e., before the experimenter 
moved), only the first three predictions per trial were completely free of the 
ambiguity that could be produced by the fact that the task had been com- 
pleted in the minimum number of moves. Only these latter data, ora total 
of 48 predictions per subject, were retained for analysis. A comparison was 
made between the number of trials for which perfect predictions and perfect 
performance were achieved by each subject. o. | 

Perfect prediction was ascertained for three criteria of perfection. The first 
(Criterion A) required that each of the three predictions be exactly the square 
that the experimenter selected on the predicted move. The second (Criterion 
B) required only that the predicted square be one that could lead to a perfect 
solution of the problem if the pattern in the ш Б ыы bn not 
considered as a factor. This criterion of accurate pre iction require m 
awareness on the part of the subject of possible "block" in боне ш ut 
did not require that he correctly indicate the exact ue үч ich the d 
menter subsequently moved. The third (Criterion а € e most re = 
in that the subject was required only to observe the rules : etermining i 
moves and to note that the predicted move was a forward move (i.e., wou 


reduce the number of moves remaining). . 
Inspection of the data showed most clearly that many of the subjects were 
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successful in completing the tasks in the minimum number of moves more 
frequently than they were successful in predicting correctly the first three 
moves of their partner. If we consider the data for Criterion A, only two of 
the 48 subjects earned a larger number of perfect predictions than of successful 
goal achievements and, in each of these two cases, the excess was a single 
success in prediction. In contrast, one subject achieved the goal successfully on 
11 trials (62 per cent), but predicted correctly on only two occasions (12 
per cent). And a second succeeded on eight trials (50 per cent), but predicted 
correctly only once (six per cent). A simple sign test (9) shows these dif- 
ferences between the frequency of perfect predic 
achievements to be highly significant (5 « .001). 
The Criterion B data changed the picture somewhat. Twenty-five subjects 
made a larger number of correct predictions than successful goal achievements; 
however, detailed examination of data showed that 42 subjects succeeded in 
reaching the goal one or more times on trials for which they had made incor- 


axed criterion, C, for which 47 of the 


requently than they were successful in 
goal achievement, detailed data showed that 20 of the subjects obtained the 


$ on one or more trials on which they had 
; Le., had not predicted a “forward” move for the 


tion and of perfect goal 


experimental sets of “human” vs. 
ects, analyses 

for Subjects 
ggested previously, 


2: А 
into components associated 


of the “other” for each trial summed over 16 trial: 


5. 
The only significant factor 


related to accuracy of prediction found for the 


— 
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subjects in Group A was the sex of the subject. The 12 women subjects 
achieved an average accuracy of prediction of 25.0 correct, 52 per cent of the 
48 predictions that each made. The 12 men subjects averaged 32.3 accurate 
predictions in 48 opportunities, or approximately 67 per cent. The “human” 
vs. “machine” sets given the subject were not factors affecting the accuracy 
of prediction. 

For the 24 subjects in Group B, the difference between sexes in the fre- 
quency with which correct predictions of the move of the “other” were made 
is again significant (p < .02). The 12 men subjects predicted accurately 64 
per cent of the time as compared with 52 per cent for the women subjects. 


F. PATTERNS OF Астѕ AND RESPONSES 


It is possible for the work of the subjects on the tasks to differ with the 
conditions of the experiment in ways that would not be reflected in the effec- 
tiveness of task performance, e.g. different patterns of responses associated 
with preferences for certain routes or general approaches to the goal rather 
than others. 

In order to make the analysis of patterning effect practical, it was necessary 
to establish general “indicators” of major pattern differences. Three such 
indicators were constructed for Task I and three for Task II. The first step 
in constructing these indices was to plot actual moves made by each subject 


on each trial. The four indices obtained are as follows: 
f moves that were made to squares in Column D. 


2. The number of moves that were made to squares in Row 2. 
3. The number of moves made to squares in Column B (Task I only). 
4. The number of moves made to squares in Column C (Task II only). 


1. The number o 


Counts of frequencies for these indices were limited to the first seven or the 
first eight moves depending upon the task. Chi-square analyses were made on 
data generated by application of the four indices to the records of the work of 
24 subjects in Group A. Although some meaningful differences in approach 
to the tasks were found, the most important finding of the analysis for the 
Present investigation is the absence of significant F ratio for the effects of the 
different sets, There is no reason to assume that these sets caused the subjects 


to approach the tasks in different manners. 


G. Discussion 
Е the effects of a set to work with a 
tner in mutual problem solving have 
"sets" produced meaningful 


е The findings of this investigation © 
machine" partner vs. a “human” par 
been generally negative. It was shown that the 
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differences in the attitudes of the subjects toward the way in which the partner 
was responding to the problem. "Human" partners tended to be regarded as 
more confused in their work than were “machine” partners. This situation 
can be understood best as a projection of the subject’s own confusion, since 
objectively there was little or no difference between the actual responses of 
their partners. "Humans" easily can be regarded as confused, but such a 
characterization of “machines” does not seem appropriate. However, regard- 
less of this attitudinal difference produced by the sets, the actual problem- 
solving behavior of the subject was not significantly affected. 

Measured by (a) accuracy of predicting the response of the partner, (b) 
effectiveness of work on the task, and (c) patterns of approach to the task, a 
null hypothesis of no difference between sets would not have 


been refuted. 
The fact that each of these measures was sufficiently 


sensitive to show sex 
difference, task-order difference, or difference associated with the interaction 


of variables gives added confidence to the view that being set to work with a 
"human" partner, as compared with a "machine" partner, was not a potent 
source of variance in the subjects' problem-solving behavior on these tasks. 

It would not be appropriate to generalize the finding of this study to tasks 
that differ markedly from those used in the experiment. "These tasks were 
deliberately chosen to be simple; no communication other than that contained 
in the signal character of the Partner response was permitted, and subject 
matter or content of the tasks had a definite "game" character. It seems very 
likely that tasks could be found for which significant difference with “human” 
and “machine” sets could be demonstrated. 

The significance of the present result lies in the suggestion that there are 


important task Properties that are common to m 


man-with-machine systems. In current applied psychological work, there is 
much concern with man-ma 


chine systems. Man's long concern with problems 
of the effectiveness of his working relationships with his fellow men is not 
in need of documentation, 


It appears that effective relationships between two 
interacting entities may have significant characteristics that can be studied at 


a most general level. Understanding of the complex problems of social inter- 


actions may be facilitated by first developing a better knowledge of the strictly 
nonsocial elements that social interactions embrace. 


an-with-man systems and 


Н. Summary AND CoxcLusioxs 


Ап experiment was designed to explore difference 
groups involved in mutual problem solving when t 
with a human partner or (b) "set" 


5 that might exist between 


hey are (а) “set” to work 


to work with a machine as a partner. 
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Participating in the experiment were 24 men and 24 women; half of each 
sex were run under each experimental condition. 

| The problem involved the use of a plugboard containing 25 squares arranged 
in a five-by-five configuration, with one square designated as the goal square. 
The object was to achieve the goal in as few moves as possible with each 
subject alternating moves with his "partner" or the “machine.” The perfect 
solution in Task I required eight moves and in Task II, seven moves. 

Three measures of performance were taken: (а) number of moves remain- 
ing to be made after the minimum required had been exhausted, (5) patterns 
that developed in moves toward the goal square, and (c) accuracy of predic- 
tion of the “partner’s” next move. The moves-remaining score showed no 
significant differences that were associated with the contrasting "human" vs. 
"machine" sets. The three indices of patterning of moves showed that the 
subjects with the “human” sets did not use routes to the goal that differed 
from those used by subjects with the “machine” sets. The subjects participat- 
ing in one condition were not better able to predict the next move of their 
“partner” than those subjects participating in the other. 

An attitude questionnaire did reveal that those subjects who had been given 
the “human” set considered their partners as more confused than did those 
working under the “machine” set. 


Although the results of this experiment cannot be generalized to all tasks, 


they suggest the existence of important structural components in certain tasks 


that are not dependent upon the social character of group problem solving. 
Much further study of these components will probably be required before we 
г understanding of their significance in relation to social and 


can attain a prope 1 lvi 
em solving. 


Personal variables that are also involved in group prob 
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CHARACTERISTICS OF NURSES IN SHORT- 
AND LONG-TERM TREATMENT SETTINGS* 


Department of Psychiatry, D. C. General Hospital 


Joan К. BorucHow?? 


A. INTRODUCTION 


As the service professions (such as medicine, nursing, and social work) have 
become increasingly specialized, the traditional stereotype of members of these 
Professions as warm, giving, interested in people, and so forth has perhaps 
less relevance than was the case 10 or 20 years ago. A comparative study of 
neuropsychiatric and general medical-surgical nurses by Navran and Stauf- 
facher (4) is illustrative of recent efforts to identify personality character- 
istics associated with subspecialties in these professions. They found general 
medical-surgical nurses to be significantly more orderly, deferent, and self- 
abasing and less introspective and aggressive than neuropsychiatric nurses. The 
interpretation was that the general medical-surgical nurses are more work 
oriented, impersonal, and less able to direct others than are their psychiatric 
colleagues, who place greater emphasis on interpersonal relationships. How- 
ever, there is some difficulty in interpreting these results as the authors do not 
Specify whether long-term and short-term treatment settings were equally 
Tepresented in their samples. Consequently, the differences may reflect differ- 
ences in intensity and duration of patient contact irrespective of the type of 
an effort was made to compare and contrast 
term and short-term treatment 
uropsychiatric and 


nurse, In the present study 
Characteristics associated with nurses in long- п 
Settings and characteristics associated with nurses in ne 


general medical-surgical settings. 
В. METHOD 


1. Subjects 
tative of four types of treat- 


Subjects were 160 registered nurses represen OF ue БЫ 
Psychiatric units in receiving 


ment settings, Each setting included 40 nurses. 


— 
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hospitals? and state mental hospitals* represented the short-term and long-term 
settings in psychiatry respectively. The operating room? represented the short- 
term general medical-surgical setting. В Е А | 

As medical-surgical treatment of tuberculosis? and orthopedic’ patients in- 
volves prolonged care and long contact, the nurses from these areas were 
combined to represent the long-term general medical-surgical setting. The 
orthopedic and tuberculosis groups were compared on background and per- 
sonality variables to see if they were homogeneous and could be combined. 
These tests revealed significant differences on three of the background vari- 
ables and one of the personality variables, Consequently, for these four 
variables, the orthopedic and tuberculosis groups were not combined in testing 
differences among the various nursing specialties. 

The nine background and 42 personality variables were incorporated into 
a five-part, self-administering questionnaire in booklet form. The forms were 
mailed to the director of nursing of each hospital for distribution to the staff 
nurses. To assure anonymity the booklet listed a code number to identify the 
subject and setting, with a request to omit the subject's name. To discourage 
the tendency to give socially desirable responses, the subject was instructed to 
mail the forms directly to the author instead of returning them to the director 
of nursing. 

2. Measures 


4. Background variables. "These were age, race, marital status (married 
or other), education, earnings, intent to remain, length of employment, and 
contentment with present position. 

b. Personality variables. These were as follows: (a 
16 items describing common physical complaints, (5) 
items descriptive of emotional complaints [V 
adapted from Frank’s Symptom Distre 
the Sociability Scale of the Guilford- 
democratic attitude—25 items adapt 


) somatic complaints— 
psychic complaints—24 
ariables (а) and (b) were 
ss Scale], (c) sociability—15 items from 
Zimmerman Temperament Survey, (4) 
ed from the Adorno-Levinson F Scale, 


3 Crownsville State Hospital, 
Center, St. Louis, Mo.; 
pital, Springfield, Md. 

4 Crownsville State Hospital, Crownsville, Md.; Eastern Shore State Hospital, 
Cambridge, Md.; Kentucky State Hospital, Danville, Ky.: Saint Elizabeths Hospital, 
Washington, D. C. 

5 George Washington Hospital, Washington, D. C.; Georgetown Hospital, Wash- 
ington, D. C.; Providence Hospital, Washington, D. C); Washington Hospital. Center, 
Washington, D. C. 

9 National Orthopedic and Rehabilitation Hospital, 
Orthopedic Hospital, Gastonia, N. C.; Suburba 


Crownsville, Md.; 


› Malcolm Bliss Mental Health 
Spring Grove Hospital, Croy 


wnsville, Md.; Springfield Hos- 


Arlington, Va.; North Carolina 


n Hospital, Arlin "ton, Va. 
7 Glenn Dale Hospital, Glenndale, Md. à кышу ча 


JOAN К. BORUCHOW 63 


(e) subservience to M.D., (f) egalitarianism, (g) impersonal attitude, (Л) 
task orientation, (i) status distance, (j) belief in personal efficacy, (4) belief 
in efficacy of nonmedical personnel, (/) personal contact and involvement, 
(m) belief in order and control [Variables (e) through (m) included from 
four to seven items and were adapted from the National Institutes of Health 
Nursing Attitude Survey], (л) autocratic, (о) distrustful, (5) overconven- 
tional, (4) exploitive, (r) self-effacing, (s) hostile-aggressive, (2) dependent, 
(и) overgenerous, (v) managerial, (w) cooperative, (x) self-reliant, (у) 
forthright, (z) accepting, (aa) responsible [Variables (л) through (аа) 
included from four to 12 items adapted from the Leary Interpersonal Check 
List (3) ; Variables (2) through (и) included positive statements, and vari- 
ables (v) through (aa) were negative], and (bb) self-ideal nurse agreement 
[the 128-item Leary Interpersonal Check List was rated for the "Self" and 
again for the “Ideal Nurse"]. Phi coefficients used as a measure of agreement. 

Analyses of variance and chi-square tests were computed to test for differ- 
€nces among the four settings on the background variables. Covariance analy- 
ses, controlling for the tendency to give socially desirable responses (2), were 
computed to test for differences among the four treatment settings on the 


personality variables. When there was an overall significant difference between 


groups, Duncan's multiple-range test was employed to determine where the 


£roups differed. 
C. RESULTS AND DISCUSSION 


The two general medical-surgical groups differed from the two psychiatric 
Sroups on only one variable: egalitarianism. This variable included items re- 
flecting class and status distinction among hospital-staff members and among 
Patients and hospital staff. Psychiatric nurses scored higher than the general 
medical-surgical nurses, indicating a more democratic attitude. 

Nurses in short-term and long-term settings also differed significantly on 
Only one variable; nurses from the short-term settings perceived the "ideal 
nurse” as more dependent than did nurses in long-term settings. — 

In contrast to the minor differences between nurses in psychiatric and gen- 


eral medical-surgical settings and betwee 


n nurses in long-term and short-term 
treatment settings, there were seven significant differences between the groups 
5, 
Tepresenting the extremes in intensity ап 


d duration of patient contact: i.e., 
nd nurses from state mental hospitals. Operating- 


dependent than did the other three 
ackground variables that showed 
and indicated least 


the Operating-room nurses à 
room nurses rated themselves as more 
Broups, a factor that can be related to the b j 
[see Table 1] that although they are the lowest salaried 
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intention to remain in their present position, they have remained in their posi- 


tion on the average of 34 months. 


The personality variables (listed in Table 2) that can be loosely interpreted 


as interpersonal-relationship measures show that operating- 


room nurses per- 


ceived themselves as more introverted and place less importance than others 


TABLE 1 


BACKGROUND VARIABLES THAT SIGNIFICANTLY DIFFERENTIATED AMONG 


THE Four Nursinc Groups 


General 
medical-surgical 
Neuropsychiatric nurses nurses 
Short Long Short Long 
Variable term term term term F 
Age (in years) 30 35 32 44 10.68** 
Бе je cent white) 33 88 93 98 69.77**2 
Married (per cent) 67 70 55 52 ЫҢ” 
Salary (in dollars) $4450 $4350 $3820 $4999 3.83* 
Supervisory 
nurses (per cent) 50 68 20 48 14.25*2 
Intention to remain 
(in months) 23 24 15 25 341* 
Months in position 26 30 34 55 2.80* 


* Significant at the .05 level. 
** Significant at the .01 level. 
a Chi square. 


TABLE 2 
MEAN Scores or PERSONALITY VARIABLE: 


AMONG THE Four NURsING Groups 


S THAT SIGNIFICANTLY DIFFERENTIATED 


General 


Neuropsychiatric nurses 


medical-surgical 


x 4 ё nurses 
ort on, ort Lon 

Variable term Gan term nes F 
Subservience to M.D. 11.70 10.90 12.60 1142 7940" 
Egalitarianism 17.50 17.48 16.20 16.19 5.51** 
Belief in йоду of nonmedical 

personne 12.90 12.9 07** 
Autocratic (self) 1.10 115 E и озне 
Overconventional (self) 1.81 1.34 2.31 172 2.93* 
Self-effacing (self) 3.14 247 4.75 3.89 8.35** 
Hostile (self) 1.90 1.94 2.86 1.69 3.78* 
Dependent (self) 3.85 3.23 5.54. 4.33 10.12** 
Accepting (ideal) 3.44 3.05 3.65 3.29 3.26* 
Autocratic (ideal) 1.74 117 1.37 71 2.76* 
Dependent (ideal) 3.56 2.60 345 2.84 345* 
Self-ideal nurse agreement (phi) .62 .69 .53 156 8.76** 
Democratic attitude 69.64 65.34 61.96 64.25 4.42** 


* Significant at the .05 level. 
** Significant at the .01 level, 
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on interpersonal relationships. Their ratings were highest on the personality 
variables, “‘overconventional” and “hostile,” suggesting less freedom and satis- 
faction in their interpersonal relationships. In addition, they saw themselves 
85 more self-effacing than did the other three groups and were lowest оп self- 
ideal nurse agreement. The latter result is consistent with Raskin’s findings 
on psychiatric outpatients, who rated themselves as less self-satisfied, sociable, 
and assertive and had poorer self admired-other agreement scores (5). Operat- 
ing-room nurses were also lowest in democratic attitude, indicating greater 
need for direction from authority figures and less desire for self-expression and 
authority in their own positions. This finding relates also to their high sub- 
Servience-to-M.D. scores. 

Long-term psychiatric nurses were opposite to their operating-room col- 
leagues on dependency and subservience to M.D. They also placed more 
value on democratic thought (interpreted as less attachment to authority and 
More on self-expression), with much interest in interpersonal relationships. 
They were highest on self-ideal nurse agreement, presenting a higher self- 
concept and perhaps indicating more satisfaction with their chosen profes- 
ston (1), 

There were also a few significant characteristics associated with nurses in 
tuberculosis and orthopedic settings. The tuberculosis nurses were older, had 
been employed in their present positions longest (mean of 67 months), and 
intended to remain in their positions longer than did the typical member of 
each of the other groups. Orthopedic nurses differed from nurses in other 
Settings on only one variable; they were lowest in belief in efficacy of non- 
medical personnel. However, considering the number of tests of significance, 
Caution should be exercised in interpreting these differences as they could occur 


by chance alone. 

In sum, there were many characteristic di y | 
Extremes of a continuum representing intensity and duration of interpersonal 
Contact with patients: i.e., the operating-room and the long-term psychiatric 
nurses, Further, these differences were in the direction predicted by the Navran 
and Stauffacher (4) study. However, there were no differences between 
ÉrOups representing less extreme positions on this continuum (i.e, between 
short-term psychiatric nurses who have relatively intense interpersonal contact 
With patients for short periods of time and long-term general medical-surgical 
Nurses who have less intense interpersonal contacts with patients for longer 
Periods of time). In other words, categorizing nursing settings on the basis 
of both intensity and duration of nurse-patient contact apparently provides a 
More meaningful distinction in terms of personality attributes of nurses than 


fferences between nurses on the 
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the distinction between psychiatric and general medical-surgical nurses per se. 
In this regard it would be of interest to know if comparable results would 
obtain if members of other service professions were similarly categorized on 
the basis of intensity and duration of patient contact. For example, do 
Travelers Aid Society social workers, who usually have brief and limited 
contact with clients, differ from social workers in state mental hospitals? and 
are these differences in personality attributes in the direction predicted from 
the results of the present study? 


D. SUMMARY 


Background and personality characteristics of 160 psychiatric and general 
medical-surgical nurses in short-term and long-term treatment settings were 
compared and contrasted. There were no marked differences between general 
medical-surgical and psychiatric nurses generally or between nurses in short- 
term and long-term settings. The major differences were between the two 
extreme groups on the dimension of intensity and duration of patient contact: 
i.e., operating room and state-hospital psychiatric nurses. As compared to state- 
hospital psychiatric nurses, operating-room nurses were more dependent, sub- 
missive to the physician, had lower self-concepts and preferred being directed 
rather than assuming authority themselves, In contrast, psychiatric nurses 
placed more value on interpersonal relationships and were more outspoken and 
extroverted than were their operating-room colleagues. 
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SOME PERSONALITY DIFFERENCES BETWEEN DOGMATIC 
AND NONDOGMATIC GROUPS*? 


Department of Psychology, San Jose State College 


Warrer T. PLANT, CHARLES W. TELFORD, AND JosepH А. THOMAS 


A. INTRODUCTION 


The purpose of this paper is to present the results of analyses directed 
toward determining how persons who differ markedly in measured dogmatism 
differ in certain other respects. The Rokeach Dogmatism Scale: Form E 


was the measure used to select extreme groups for this purpose. 
In the monograph Political and Religious Dogmatism (13), Rokeach 
thoritarianism or dogmatism. Since 


Published a 40-item measure of general au 
its publication, the Dogmatism Scale (hereinafter referred to as the D Scale) 
has been and continues to be used extensively in social-psychological research 
and particularly in research dealing with attempts to measure the non- 
intellectual effects of a college education (+, 8, 12, 15). The most compre- 
hensive summary of work with the scale to date is found in Rokeach’s book 


The Open and Closed Mind (14). 


In the development of the D Scale, } cn 
this scale is less loaded with a left-right or liberal-conservative political 


dimension than is the familiar California F Scale, and it contains less avowed- 
Prejudice content than docs the California Ethnocentrism Scale (or E Scale). 
Intercorrelational studies of the D Scale and F Scale with measures of 
Political and economic conservatism have consistently yielded lower cor- 
relations for the D Scale than for the F Scale. An examination of scale 
Content yields the observation that the D Scale contains much less avowed- 
Prejudicial material or stated-prejudicial material than does the E Scale. 
Rokeach has also reported intercorrelational studies with the D, E, and F 
Scales from which he has concluded that the D Scale is a better measure of 
general authoritarianism than is the F Scale. Replications of certain of these 
correlational studies (11) with American subjects, as against Rokeach's 
English subjects, have reported general agreement with the Rokeach findings. 
Rokeach (13) has reported odd-e half reliability coefficients of 
Sy, 
бу, Received in the Editorial Office, Provincetown, Massachusetts, on April 1, 1964. 
parent, 1965, by The Journal Press. s a part of the research program under 


The d is $ obtained a 
Contract 914 gor шау ТЕ нов, Cooperative Research Branch (15). 


67 


systematic effort demonstrated that 


ven split- 
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.81 for a small sample of English college students and 78 for a sample of 
English workers. Plant (12) has reported odd-even split-half reliability 
coefficients of .84 for a sample of 400 male college freshmen and 85 for a 
sample of 400 female college freshmen. The scale is sufficiently reliable for 
use in research dealing with group differences. 

Peabody (10) has reported extensive work on response set and the D Scale. 
He used the D Scale, the F Scale, an anti-Semitism scale, a conservatism 
scale, and reversed scales for each. Peabody, using both English and American 
samples of subjects, has concluded that lew scores on the D Scale represent 
rejection of dogmatism; whereas high scores represent a combination of 
“agreement set” or acquiescence and acceptance of dogmatism. . 

"There have been studies aimed at determining how groups of subjects with 
extreme scores on the D Scale differ also in other respects. Rokeach (14) 
demonstrated that persons with relatively high D-Scale scores solve complex 
laboratory problems much less well than do persons with relatively low 
D-Scale scores. The problems that have been used require that a subject be 
able to overcome sets of beliefs about the nature of things that ordinarily are 
used in complex-problem solution and to adopt new approaches to the prob- 
lem and its solution. Fillenbaum and Jackman (3) 


Rokeach studies with subjects having high and low D-S 
same complex 
In addition, 


have replicated the 
cale scores using the 
-problem task and have obtained essentially the same results. 
these investigators have demonstrated that subjects with low 
scores on the D Scale also have relatively low scores on a measure of anxicty 


and that those with high scores on the D Scale have high scores on the anxiety 
measure. 


It would seem that the D Scale is a reasonably 
authoritarianism or dogmatism and is one for wh 
available about subjects who score relatively 
is to add to the information on subjects wh 
low in dogmatism as measured by the D Sca 


good measure of general 
ich some information is 
high or low on it. Our intent 


о are high and those who are 
le. 


B. Measures UsEp AND METHODS or ANALYSIS 
1. Measures Used 


In the summer and fall of 1960, a battery of psychological tests was given 
to all persons indicating an intent to enroll for the academic year 1960-1961 
as freshmen in one of six California public junior colleges (15). The total 
number of subjects for whom usable test results were obtained was 4,506 
(2,643 males and 1,863 females). All sub 


jects were given the special battery, 
a scholastic-aptitude measure, and various achievement measures used in the 
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several colleges for placement purposes. The special battery included the 
following group measures: | 

1. The Rokeach Dogmatism Scale: Form E (13). 

2. The Modified California Psychological Inventory (5), hereinafter 
referred to as the CPI. The complete Gough inventory includes 18 scales. 
The modified inventory includes five of these scales taken intact and printed 
into a new booklet. The five scales, chosen on the basis of local factor- 
analysis studies (6), were the sociability (Sy), self-control (Sc), achievement 
via independence (Ai), intellectual efficiency (Ie), and responsibility (Re) 


scales, 
3. The Allport, Vernon, and Lindzey Study of Values (1) hereinafter 
гп scale yields scores in the follow- 


referred to as the AVL. This widely know 
ing value or interest domains: theoretical, economic, aesthetic, social, political, 
and religious. 

The scholastic-aptitude measure was 
(SCAT): Form A, or another similar mea: 
converted to SCAT total raw scores by a conversion 
Psychological Corporation (2); 


the School and College Ability Test 
sure, the scores for which were 
procedure offered by the 


2. Methods of Analysis 


Two sets of statistical analyses were made of the data from the test battery 
for extreme groups on the D Scale. For the analyses of Study 1, the top and 
bottom 10 per cent of the scores on the D Scale were selected from the sample 
of 2,643 males and from the sample of 1,863 females. Means and f ratios 
for the five CPI-scale scores, the six AVL-scale scores, and the scholastic- 
aptitude-scale score were computed for the high D-Scale score males vs. the 
low D-Scale score males and for the high D-Scale score females vs. the low 
D-Scale score females. Separate analyses for males and for females were 
undertaken because of the repeated finding of significant sex differences on 
the D Scale (8, 12, 15), on the CPI scales (5), and on the AVL scales (1). 


For the analyses of Study II, 110 pairs of males and 110 pairs of females 
bers of the pairs were matched within one 


Were selected for study. The mem 
raw-score point on the total raw score from the SCAT measure, but one mem- 
5 per cent on the D Scale while 


ber of each pair had a score from the highest 1 
from the lowest 15 per cent on the 


the other m Май = 

ember of the pair had a score 
D Scale. Statistical tests of t i f the difference between cor- 
related means were undertaken matched pairs of males and for 
the 110 matched pairs of females on the five CPI-scale and the six AVL-scale 
Scores, 
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C. RESULTS AND Discussion 
1. Study I 


The results of the tests of the significance of the difference between sample 
means for the five CPI scales and the SCAT total raw scores for low D- 
Scale males vs. the high D-Scale males are given in Table 1. The same kind 
of results for the same variables for females with low D-Scale scores vs. 
females with high D-Scale scores are given in Table 2. 

Each of the comparisons between dogmatic males and nondogmatic males 
reported in Table 1 yields a mean difference significant beyond the .01 level. 


TABLE 1 
CPI AnD SCAT Comparisons BETWEEN Mates Low IN MEASURED DOGMATISM 
AND Mates HIGH IN MEASURED Docmatism 


Low High 
dogmatism dogmatism 
(N = 264) (N = 264) 
Variable Mean SD Mean SD Difference t 
D Scale, Form E 119.3 12.5 213.9 11.8 
California Psychological 
Inventory 
Sociability (Sy) 25.1 5.1 22.2 47 2.9 6.88* 
Self-control (Sc) 284 8.2 21.6 7.7 6.8 9.86" 
Achievement via 
independence (Ai) 20.4 4.6 12.8 3.8 7.6 20.68* 
Intellectual efficiency (Ie) 37.9 6.2 30.1 6.3 7.8 14.40° 
Responsibility (Re) 29.1 5.4 258 5.2 3.3 7.16" 
School and College Ability 
Test: Total 68.1 18.7 52.1 15.1 16.0 10.78" 
* $« 0l, 


CPI лхр SCAT Comparisons BETWEEN FEMALES Low iN Measurep DoGMATISM 
AND FEMALES HIGH IN MEASURED 


Docmatism 
Low High 
dogmatism dogmatism 
— (N — 186) (N — 186) 
ariable Mean бр Mean SD Difference t 
D Scale, Form E 111.1 11.8 
California Psychological aie 128 
Inventory 
Sociability (Sy) 25.1 5.5 21.6 6.7 5 
а . i 3.5 5.50 
Self-control (Sc) 30.2 8.1 24, ж 
Achievement via В id i мај 
independence (Ai) 21.2 3.9 14.0 36* 
Intellectual efficiency (Je) 385 5% Bi X 25 Из 
Responsibility (Re) 321 456 28.5 54 3.6 — 683* 
School and College Ability J P 
Test: Total 623 191 478 167 15.0 8.07* 
* $« 01. 
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Without exception, the nondogmatic males had the highest means on the five 
CPI variables and on the SCAT variable. 

Each of the comparisons between dogmatic females and nondogmatic females 
reported in Table 2 yields a mean difference significant beyond the .01 level. 
Without exception, the nondogmatic females had the highest means on the 
five CPI variables and on the SCAT variable. 

The results of the tests of the significance of the difference between sample 
means for the six AVL scales yield one statistically significant difference for 
males and one for females. The nondogmatic males had a significantly lower 
mean on the AVL religious scale than did the dogmatic males. The non- 
a significantly higher mean on the AVL aesthetic 
In these two instances of statistical 
сеп sample means are small relative 
viations for the means. In addition, 


dogmatic females had 
scale than did the dogmatic females. 
Significance, the score differences betw 
to the average of the two standard de 
When controls on ability differences were made for the analyses of Study II, 
none of the AVL differences between means for dogmatic males vs. non- 
dogmatic males or for dogmatic females vs. nondogmatic females was signifi- 
cant, In light of these considerations, no empirical results for the AVL com- 


Parisons will be reported. 
2. Study II 


The analyses made for Study I were repeated for samples of subjects high 
ability-test score because in Study I 


and low on the D Scale but matched on 
the differences between high and low dogmatics on the SCAT total raw score 
Were substantial. In addition, studies of very bright vs. very dull subjects (9) 


report significant differences on CPI v les, with the very bright subjects 
having the highest means; and studies of very bright subjects (7, 16) report 
relatively high mean scores on the CPI scales. The purpose of these analyses 
Was to determine if there were personality differences between those high 
and those low in measured dogmatism but alike in measured scholastic 
aptitude. 

The results of the tests of the significance of the difference between cor- 
related sample means for the five CPI scales for low D-Scale males vs. high 
D-Scale males but matched on SCAT scores are found in Table 3. The same 
kind of results for the same variables for females with low D-Scale scores 
vs. females with high D-Scale scores but matched on SCAT scores are found 


in Table 4, 


Each of the comparisons reporte 
nondogmatic males who were ma 


ariab 


d in Table 3 between dogmatic males and 
tched on ability-test scores yields a mean 
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TABLE 3 
CPI Comparisons BETWEEN MALES Low IN MEASURED DOGMATISM AND Mates 
HicH Іх Measurep Юосматіѕм Вот MATCHED ON ABILITY 


Low High 
dogmatism dogmatism 
(N = 110) (N — 110) r 
Variable Mean SD Mean SD Difference 1 
D Scale, Form Е 115.7 14.8 211.9 12.9 
California Psychological 
Inventor: 
Sociabiliy (Sy) 25.5 52 21.9 47 3.6 347% 
Self-control (Sc) 29.7 7.5 21.7 7.8 8.0 7.72 
Achievement via 
independence (Ai) 20.7 42 14.0 3.9 6.7 14.44% 
Intellectual efficiency (Ie) 38.7 4.9 31.9 6.7 6.8 9,60. 
Responsibility (Re) 30.0 5.0 26.9 5.2 3.1 4.59 
* 5« 0l. 
TABLE 4 


CPI Comparisons BETWEEN FEMALES Low in MEASURED Docmatism AND FEMALES 
HIGH IN MEASURED Docmatism Bur МаАтснЕр on ABILITY 


Low High 
dogmatism dogmatism 
(N — 110) (N — 110) 
Variable Mean SD Mean SD Difference t 
D Scale, Form E 116.2 13.0 204.4 14.0 
California Psychological 
Inventory 
Sociability (Sy) 24.6 57 23.0 7.2 1.6 1.99* 
Self-control (Sc) 31.0 8.2 24.7 8.5 6.5 5.88** 
Achievement via 
independence (Ai) 20.2 4.2 15.3 4.1 4.9 9.50** 
Intellectual efficiency (Ie) 37.2 6.0 32.8 5.5 44 6.45%" 
Responsibility (Re) 315 — 4$ 29.3 60 22 3.29** 
* < 05. 
** $7 01. 


difference significant beyond the .01 level. Without exception, the nondogmatic 
males had the highest means on the five CPI variables, 

Four of the comparisons reported in 
dogmatic females who were matched 
difference significant beyond the .01 ley 
scale (Sy) yields a mean difference sig 
exception, the nondogmatic females h 
variables. 


None of the tests of the significance of the difference between sample cor- 
related means for high D-Scale males vs, low D-Scale males or for high 


D-Scale females vs. low D-Scale females on the AVL was statistically sig- 
nificant. 


Table 4 between dogmatic and non- 
оп ability-test scores yields a mean 
el. One comparison, on the sociability 
nificant beyond the .05 level. Without 
ad the highest means on the five CPI 
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2 


3. Discussion 


When comparisons were made of measured personality characteristics of 
persons with extreme scores on the D Scale, statistically significant differences 
were obtained on the five scales of the CPI and on the SCAT. This was true 
for samples of males and of females, and (without exception) the relatively 
nondogmatic subjects had the highest mean scores on the CPI and SCAT 
scales. Compared to highly dogmatic subjects, the nondogmatic subjects would 
be described as being outgoing and enterprising, calm and patient, mature 
and forceful, efficient and clear thinking, planful and responsible, and more 
likely to succeed in an academic setting than would the highly dogmatic 
subjects. These descriptive adjectives come from the descriptions of the CPI 
scales and the extensive validity studies conducted by Gough (5). The descrip- 
tion of the nonintellectual characteristics of those relatively free of dogmatism, 
compared with those high in dogmatism but alike in scholastic aptitude, is 
the same as reported earlier. Controls on the aptitude variable through the 
matched-pair procedure did not change the results. 

Although all the differences between the low dogmatics and high dogmatics 
on the CPI variables were statistically significant, the differences on the 
achievement-via-independence and intellectual-efficiency scales were most strik- 
ing. The mean differences on the first of the two aforementioned scales, with- 
out exception, were larger than the average of the two standard deviations of 
the means from which the differences were obtained. On three of the four 
Comparisons reported for the intellectual-efficiency scale, the same kind of 
difference relative to the standard deviations was obtained. 

A different type of comparison is provided between the CPI means obtained 
in this study and those obtained from Gough’s normative samples (5). In 
Tables 5 and 6, the means for the high-school subjects and for the college 


TABLE 5 
CPI Means rog Mate NORMATIVE SAMPLES AND Low-DoGMATISM SAMPLE 
HicH-DoGMATISM SAMPLE 


AND 
High- 
1 College В 
didents students Low High 
CPI Gough (5) Gough (5) dogmatism dogmatism 
Scale (N = 3572) (N — 680) (N — 110) (N — 110) 
Sociability (s 21.5 254 25.5 21.9 
Self-control (3 25.3 29.2 ZU, 2 
Achievement via 20.7 14.0 
Intudependence (Ai) 14.6 2 38.7 31.9 
ellectual efficiency (Ie) 33.6 315 30.0 269 


€sponsibility (Re) 26.7 
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subjects in Gough’s standardization samples are presented, as are the means 
on the CPI scales for the high dogmatics and low dogmatics of the present 
study matched on scholastic aptitude. Table 5 contains the relevant means for 
males; Table 6, for females. 


TABLE 6 
CPI Means FoR FEMALE NORMATIVE SAMPLES AND Low-DoGMATISM SAMPLE 
AND HicH-DoGMATISM SAMPLE 


High- 

school College 

students students Low High 
CPI Gough (5) Gough (5) dogmatism dogmatism 
Scale (N — 4056) (N — 2120) (N — 110) (N — 110) 
Sociability (Sy) 214 26.0 24.6 23.0 
Self-control (Sc) 27.6 30.8 31.0 24.7 
Achievement via 

independence (Ai) 15.5 21.9 20.2 15.3 

Intellectual efficiency (Ie) 34.4 414 37.2 32.8 
Responsibility (Re) 30.0 33.3 31.5 29.3 


The mean CPI scores for the junior-college freshman males who had very 
low scores on the D Scale were very similar to those of Gough’s college- 
student sample. The means for the freshman males who had very high scores 
on the D Scale were very similar to those of Gough’s high-school sample 
and, in some cases, were below the level of the high-school sample. These 
differences seem to indicate that males with high D-Scale scores are likely 
to be psychologically immature and, although college freshmen, not as mature 
as many high-school subjects. The same general type of interpretation can 
be made for the female freshman subjects who had very high scores on the 
D Scale. These comparisons indicate that the dogmatic college freshman 


would be described as being impulsive, defensive, and conventional and 
stereotyped in thinking, 


D. SUMMARY AND CONCLUSIONS 

A battery of psychological tests, including the Rokeach Dogmatism Scale 
(D Scale), five scales from the California Psychological Inventory (CPI), 
the Study of Values (AVL), and a scholastic-aptitude test (SCAT), was 
given to a large sample of entering junior-college freshmen. 

In order to study and to describe some of the ways in which highly dog- 
matic persons differ from nondogmatic persons, extreme groups on the D 
Scale were chosen. Comparisons for these extreme groups were made on 
CPI, AVL, and SCAT scores. In addition, the comparisons on the CPI 
and AVL were made for samples who had extreme scores on the D Scale 
but who were matched on aptitude-test score, 
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V : Я : 
Nondogmatic males compared with dogmatic males and nondogmatic 


females compared with dogmatic females were found to differ significantly 
on each of the five CPI scales. Without exception, the nondogmatic groups 
had the highest means on the CPI scales. It was concluded that highly 
dogmatic subjects were psychologically immature and could be characterized 
as being impulsive, defensive, and conventional and stereotyped in thinking. 


16. 
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A. INTRODUCTION 


The investigation of culture patterns by anthropologists and historians has 
long proceeded by methods that may be illustrated at their best by the writings 
of Benedict (1), Mead (20, 21), Keyserling (18), Toynbee (24), and others. 
A radically new approach, involving quantifying of observations and basing 
theory on an explicit, mathematical model was proposed by Cattell in 1951 
(10) ; and it has generated concepts in studies by his associates which show 
the fruitfulness of the approach (6, 12). This new methodology (a) has 
taken nations as its.organisms for correlation entries, on the thesis that these 
alone have sufficient autonomy for that internal feedback which guarantees 
the organic consequences of culture practices and thus makes factors meaning- 
ful; (b) has taken a wide array of variables—psychological, historical, eco- 
nomie, sociological—on the thesis that causal lines do not respect academic 
departments; (c) has distinguished three panels of variables—syntality, 
Structure, and population—believing that these are closely connected by a 
stated formula (5); and (4) has employed simple-structure rotated factor- 
analytic solutions to discover independent, functionally unitary dimensions of 
national syntality, on the belief that the basic hypotheses in this area are 
best generated from correlational observations in detachment from armchair 
Prejudices. | 

Among the 12 or more independent dimensions initially appearing in 
Cattell’s study following this methodology (3, 10) was one tentatively 
designated “Poor Cultural Integration and Morale vs. Good Internal Moral- 
ity.” It loaded high death rate from syphilis, many deaths from alcoholism, 
many deaths from typhoid, tuberculosis, and other diseases brought under con- 
trol in well-integrated communities, presence of restrictive laws against di- 
ee Massachusetts, on April 2, 1964. 


concerning the interpretation of some 0 
Pr a is investigation was U 
€doctoral Fellowship from the Nationa 
ublic Health Service, at the University 0 
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vorce, and some less interpretable, lower variables, such as many miles of rail- 
road per person. Some further light was thrown upon it by the P-technique 
factor-analytic studies (correlating time series over 100 variables for the 
factors in the country’s historical development) by Gibb (12) for Australia, 
Cattell (4) for Great Britain, and Cattell and Adelson (6) for the United 
States. These showed a pattern tentatively called “Cultural Integration vs. 
Slum Standards” which similarly opposed community conscience and integra- 
tion to carelessness, squalor, crime, and high disease rate. Although the 
samples and variables are not so exactly carried over from study to study as 
in the foregoing set of researches, the work of Hofstaetter (16) and Thorn- 
dike (23)—correlation clusters only—confirms a similar organic aggregate 
of measurable variables covering a variety of crime manifest 


ations, absence 
of civic conscience (indeed 


› Positive evidence of civic corruption), and substi- 
tution of crude, rigid, external controls for individual conscience, From these 
several studies, it is hypothesized that the present study will r 
factor and find the interpretation of it supported by variables not previously 
included. It is also expected that evidence relating to alternative hypotheses, 
such as its being a health-related or slum factor, will be forthcoming. 

The interpretation includes morale of the community as well as its morality 
on essentially two counts: (a) the nature of many of the variables that show 
at the pole of high morality a relatively well-integrated culture (e.g., lack of 
slum conditions) in which the members are interes 
uphold community standards and (b) research on smaller groups which has 
found that groups with high morale have high morality and low morale 
groups have low morality: €.£, Hartshorne and May (15). Unfortunately, 
the present investigation is limited in the clarification of this aspect, due to 
the nature of the variables available, 

The interpretation we are seeking to clarify concerning moral behavior of 
individuals is in the sense of practicing values agreed on by the group which 
also transcend individual satisfactions; it would be one of several influences 
contributing to group synergy. [As seen through the education-affluence factor 
(3), educational influences are only related in a limited way.] The greater 
part of its variance may well arise from differences in willingness to accept 
ethical standards, but it nevertheless would be suspected of influencing а 
wide range of social behavior, including an ultimate effect on what com- 
munities are ready to pay for social services, as shown by loadings reflecting 
typhoid and tuberculosis control. Contingently, it justifies definition as that 


in the individual which generates community conscience and responsibility 
and which influences national statistics accordingly, 


eplicate the 


ted in doing their share to 
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In addition to the foregoing, a further goal of the study will be to replicate 
major previously found factors for two reasons: (a) the analytical procedure 
to be pursued is that of factor-analytic simple structure which necessitates an 
adequate number of nonrelated variables to determine exactly the hyperplane 
of the factor under investigation independently of the variables related to the 
hypotheses and (b) the interrelationships of the present factor with other 
major dimensions is an important clarifying process. When one does find the 
factor replicated and clarified in the same total context as previously found, 
the chance of this being an artifact of the present research or its methods is 
decreased. 

B. DEFINITION oF VARIABLES 
into three parts, each of which follows a 


separate criterion for finally selecting the variables to be used from among 
all those possible. The first category, primary-marker variables (variables 
numbered 1 to 13 in this study), is that containing variables found in previous 


research to load a factor of morale, morality, and cultural integration and 
15 used in this study to interlock it with previous concepts emerging for this 


factor. A second class of variables, numbered 14 to 29, consists of markers 
Which have loaded other factors in previous studies and are included here 
not only as “hyperplane fodder” for good rotation of the morality factor, 
but also so that other factors which have appreciable influence on the primary 
marker variables will be recognized, if they occur. 

The third class of variables covers the alternative directions in which we 
have speculated about the nature of the morale and morality factor per se. 
These variables, numbered 30 to 48 in the matrix list, are thus aimed to shed 
new light on the nature of the factor being investigated. The variables were 
chosen on the basis of their hypothesized importance and availability for all 
the countries. Many variables other than those in Table 1 were considered 
but were not included in the final analysis because of insufficient data avail- 
able or because the ones included were thought to be more important. Un- 
fortunately, in the end, a majority of our more “choice” variables were m 
Teported by a large number of countries and therefore could not be included.’ 


The variables finally included, al 
them, are as follows: 


The data collection is divided 


ong with pertinent information about 


1. Syphilis death rate. The original raw-score distribution was posi- 
a 4 
8 X5 Б 
The criterion for dropping а 


с : ^ 
orrelations, the matrix of principa 
mation are given in Gorsuch (13). 


a list of other variables, the 


or country, ^ T 
max solution, and other in- 
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* All decimal points dropped. 
** According to U.N. rating. 
*** Not available. 


TABLE 1 
RELIABILITY OF VARIABLES* 
i Full-length Stability Per cent 
Megane Title reliability coefficient "reliable"** 
Syphilis death rate 81 68 75 
i TB. death rate 87 77 75 
3 Birth rate 97 94 72 
4 Alcohol death rate 95 90 75 
5 Homicides and war death rate 84 73 70 
6 Miles of railroad per inhabitant 99 99 = 
7 Typhoid-fever death rate 96 92 75 
8 Death rate (gross) 94 89 75 
9 Marriage rate 95 91 84 
10 People per dwelling sss m one 
11 Population density 99 99 one 
12 Students per teacher 80 67 wan 
13 Suicides 97 97 70 
14 Area 99+ 99-- 99 
15 Cities per inhabitant eee vee was 
16 Population 994+ 99+ 69 
17 С1азһез 37 23 onk 
18 Literature and science prizes 70 53 oe 
19 Government ministries 86 76 dn 
20 National income per inhabitant 97 95 ае 
21 Telephones per inhabitant 99-- 99-- bod 
22 Science books 98 97 79 
23 Cancer death rate 81 69 75 
24 Buddhists ane s.. wae 
25 Heart death rate 91 84 75 
26 Political parties 88 79 nee 
27 Defense expeditures 97 93 ae 
28 Mohammedans Бы «ов e 
29 Catholics ы +. uae | 
30 Climatic efficiency ded “on ime 
31 Prostitution concern ses ow уе 
32 Cost-of-living trend sss oon ee 
33 Divorce rate 83 71 7 
34 Forced labor s.. +e s.. 
35 Hypertension death rate 74 58 70 
36 Illegitimate births 99 99 82 
37 International-aid index wae ^um wes 
38 News censorship 91 84 Мы 
39 Peace prizes 84 72 bal 
40 Physicians per inhabitant 97 94 94 
41 Population increase rate ne A жже 
42 Protestants ses one sum 
43 Religious books 89 81 77 
44 Riots and rebellions 76 62 ^d 
45 Students per inhabitant 93 87 se» 
46 UNICEF contributions per 
inhabitant ssr sss se 
47 Latin American country eee ssr жя» 
48 Extreme syphilis death rate eee ЕРА rks 
49 Random number eee one at 
50 Random number ++» +e: Ll 
51 Random number ses эзе +t | 
| 


Dainas 
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tively skewed; therefore the data were normalized before correlation, It 
has loaded the factor under study in five separate investigations. 

2. Tuberculosis death rate. Positively skewed, therefore normalized. 
Loaded the factor in three studies. 

3. Birth rate (gross). Slightly 
factor in three out of the four studies in which it appeared. 

4, Cirrhosis-of-the-liver (alcoholism) death rate. Slightly positively 
skewed; so normalized. Loaded the factor in three out of five studies. 

5. Homicide and war death rate. (Note inclusion of the latter; but if 
Richardson (22) is right, this contamination is not serious.) Positively 


zed. Loaded the factor in two out of three studies. 
Positively skewed; so 


bimodal; so normalized. Loaded the 


skewed; so normali 
6. Miles of railroad per 100,000 inhabitants. 


normalized. Loaded in two out of three studies. 
7. Typhoid-fever death rate. Positively skewed; so normalized. Loaded 


the factor in two studies. 

8. Death rate (gross). Positively skewed; so normalized. Loaded the 
factor in one out of four studies. 

9. Legal-marriage rate. Negatively skewed; so normalized. Loaded 
the factor in one study. | 

10. People per dwelling. Approximately normal; so normalized. Loaded 
the factor in one study. " 

Positively skewed; so normalized. Loaded the 


11. Population density. 
factor in one out of two studies. 
12. Students per teacher. Primary school only. Approximately normal; 
so normalized. Loaded the factor in one study. 
13. Suicide and self-inflicted-injury death rate. 
normalized. Loaded the factor in one out of three studies. ә 
14. Gross area in 10,000 km2 Positively skewed; so normalized. | 
15. Cities over 20,000 per million inhabitants. Rectangular distribution. 
16. Population (gross). In hundred thousands. Positively skewed; so 


normalized. . 
17. Clashes with other countries. Excludes wars; includes border con- 
flicts involving armies. Continuing conflicts were scored once for each 


year. Positively skewed; 


18. Nobel Prizes in literat 
zeros; so trichotomized. 


19. Government Ministries. 
20. National income per capi 


Positively skewed; so 


so normalized. 


ure and science. Positively skewed with many 


Approximately normal; so normalized. 
ta. In U.S. dollars. Positively skewed; so 


normalized. А. : 
21. Telephones per 10,000 inhabitants. Positively skewed; so nor- 

mali 1 
ized. titles in pure and applied 


22. Science books published. Number of 


sciences. Positively skewed with many zeros; 50 poca i 
23. Cancer death rate. Approximately normal; so normalized. — 
24. Proportion of the population Buddhist. Many zeros; so dichoto- 


mized. . adi 
25. Heart-disease death rate. All types except hypertension witl eart 
disease. Positively skewed; 50 normalized. 
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26. Number of political parties. Includes only those strong enough to 
elect candidates to national offices. Positively skewed; 50 normalized. 

27. Proportion of national budget for defense. Colonies given one-half 
the score of the mother country. Positively skewed; so normalized. 

28. Proportion of population Mohammedan. Many zeros; so dichoto- 
ee of population Catholics. Bimodal distribution. І 

30. Climatic efficiency. Huntington’s rating of the human-work efficiency 
of the climates throughout the world, given the present civilization. Based 
on optimum temperature (extremes and averages), rainfall, and number 
of storm fronts as indicated by research. Average for the inhabitants 
according to 1950 population distribution. Rectangular. 

31. Concern over traffic in persons and prostitution. Signers of the 
United Nations statement. Dichotomous. 

32. Cost-of-living trend. Per cent increase of 1958 over 1953. Positively 
skewed; so normalized. 

33. Divorce rate. Positively skewed with some zeros; so normalized. 

34. Use of forced labor. For economic or political reasons. Dichotomous 
distribution. 

35. Hypertension death rate, Approximately normal; so normalized. 

36. Proportion of births illegitimate. Positively skewed; so normalized. 

37. International-aid index. Includes all types of aid and loans in U.S. 
dollars. Base of 100. Less than 100 means aid received; more than 100, 
aid given. Approximately normal; so normalized. 

38. News censorship. Rated according to censorship of outgoing news 
reports. Positively skewed with some zeros; so trichotomized. 

39. Nobel peace prizes. Positivel 
omized, 

40. Physicians per 100,000 Inhabitants, Bimodal. 

^1. Population increase rate. Per cent increase of 1958 over 1953. 
Approximately normal with slight bimodality; so normalized. 

42. Proportion of population Protestant. Positivel 
zeros; so trichotomized. 


43. Religious books published. Number 
normalized. 


44. Riots and rebellions. 
widespread nature. Continui 
zeros; so trichotomized. 

45. Students per 1,000 Inhabitants. 
malized. 

46. Contributions to UNICEF per 1,000 Inhabitants. 
Positively skewed with some zeros; so normalized. 

47. Latin American Country. All Centra 
scored 1, all others scored 0. 

48. Extreme syphilis death rate. All countries w 
rate greater than an absolute one SD from the me 


ceived a score of 1; all others were scored zero, 


у skewed with many zeros; so trichot- 


у skewed with many 
of titles. Positively skewed; so 


Internal revolt or disorder of a serious and 
ng rebellions counted as two per year. Many 


Approximately normal; so nor- 
In U.S. dollars. 
l and South American countries 


ith a syphilis death 
an of Variable 1 re- 
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49. Random number. Positively skewed trichotomy. 
50. Random number. Rectangular distribution. 
51. Random number. Normal distribution. 


The data in this study were collected from a period not overlapping that 
from which the previous definition of the morality factor was obtained (i.e., 
for the period of 1953 to 1958, with a few instances going back as far as 
1947); but, of course, going back further for variables such as the total 
number of Nobel Peace Prizes. Data for each variable were, if possible, 
collected twice: once at the beginning of the time period and once at the end. 
Not only does this increase the stability of the results, but (what is more 
vital) it allows the calculatién of stability and reliability coefficients (see 
Table 1). The United Nations has also evaluated the data published by its 
committees and has indicated the figures thought not to be reliable or to be 
inaccurate. In Table 1, the per cent of each variable which the United Nations 
indicated as being of lower reliability than the rest can also be found. The 
only variable of questioned reliability, in this sense, is number 17; but it was 
retained because of its importance as a marker variable. 


C. THE SAMPLE OF CouNTRIES 


Because of the possibility of capitalizing on chance in a correlation matrix, 
the common rule of thumb is to have at least three or four times as many 
Observations (in this case, nations) as variables. To avoid difficulties from 
the number of variables approaching the number of cases in the special case 
when the total population of cases from which to draw is limited, one has two 
Options: (a) including a small number of variables in the analysis, repeating 
the study several times with new extension variables each time, and comparing 
the results from several studies or (b) including all of them in one matrix and 
avoiding the problem of factor matching and crossinterpretation. The same 
number of variables is eventually involved in either case, except the latter 


method is quicker and makes use of more information. It is obvious that the 
о base conclusions on, the more likely the con- 
h error entering into each bit of information, 
ons are drawn in each case. Also the 
Present study is a special case since the sample, although = ч шш, 
Contains virtually all of the population ; thus шшен е = : eet Д mel- 
ization problems (except, of course in terms of generalizing = hve в E 
ever, the reader must be cautioned not to draw wider conclusions than the 


Sample warrants 
ants. hi» 
: ; research by th 
Another caution and limitation 15 placed on the present res y the 


More information one has t 
clusions are to be correct, even wit о 
assuming the same number of conclust 
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nature of the variables. The data were usually reported by individual nations 
and many sources of error can enter, including chance miscalculation, defects 
in the data from improper sampling, differing definitions of the variables, and 
of course from motivation distortion through political aim or pride. 

The nations on which sufficient data were available are as follows: Argen- 
tina, Australia, Austria, Belgium, Brazil, British Guiana, Burma, Canada, 
Ceylon, Chile, China (Taiwan), Columbia, Costa Rica, Czechoslovakia, 
Denmark, Dominican Republic, Ecuador, Egypt, Finland, France, Germany 
(West), Greece, Guatemala, Hungary, Iceland, Ireland (Irish Free State), 
Israel, Italy, Japan, Luxembourg, Maltese Islands, Mauritius, Mexico, 
Netherlands, New Zealand, Nicaragua, Nigeria, Norway, Panama, Portugal, 
Puerto Rico, Rhodesia and Nyasaland, Spain, Sweden, Switzerland, Trinidad 
and Tobago, Union of South Africa, Union of Soviet Socialist Republics, 
United Kingdom, United States of America, Uruguay, 


end бЁ 509}, and Yugoslavia (а 


D. Tue ANALYvsis 


. The Pearson product-moment formula was used to calculate the correla- 
tions among the 51 variables. Its advantages for such work are that it utilizes 
the maximum amount of information and may be computed on any distribution 
that is fairly symmetrical and unimodal so long as it is not badly skewed. 
Applying the Pearson formula to the dichotomies and trichotomies gives 
comparable coefficients to those obtained from the other variables (14). The 
major disadvantage of the reduction of some variables to dichotomies and 
trichotomies is the lost information and therefore the greater inaccuracy of 
the coefficients; however including the variables in their original forms was 
not an acceptable option because of the extreme skewness of the variables 
(the usual transformations to eliminate skewness could not be applied because 
of numerous zeros that would be unaffected) ; therefore the present solution 
provides a practical approach to these variables, After examination of the 
distributions, the authors decided to transform the majority of th riables 
toa normalized condition which most of them closely usi d. S : ны lots 
were checked in regard to the assumption of linearity,4 The vien mE 
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then correlated with the aid of the Illiac digital computer; the resulting 1,275 
correlations are given elsewhere (13). 

The Guttman criterion of temporarily putting unities in place of com- 
munalities and stopping extraction when a latent root first drops below unity 
Was used to decide the number of factors. Communalities were then estimated 
iteratively for the 14 factors indicated. The unrotated matrix is given 
elsewhere (13) along with the residuals after 14 factors were extracted by 
the principle-axes method. Cattell’s definition (8) of Thurstone’s simple- 
structure criterion—the maximized total per cent of zero loadings—was used 
to direct visual rotations to a unique resolution. Analytical approximations 
were first tried (by both orthogonal and oblique programs), but were found 


to give poorer simple structure than the visually rotated solution. After seven 


hand rotations, reaching a plateau with a = 10 hyperplane count of 58 per 
Cent, it was felt that the simple-structure criterion had been met. The unrotated 


Matrix and the reference vector structure from which the hyperplane count 
Was made are given by Gorsuch (13). The primary-factor pattern, with the 
factors arranged in the approximate decreasing order of variance accounted 
for by each factor, is given in Table 2, and Table 3 gives the correlations among 
the factors, The loadings of the random variables gives an approximate level 
of "significant loadings.” All calculations were to at least five digits and 


Tounded only when placed in the tables. 


For the interpretative discussion in the | : i 
Est to set out the other factors as well as our main concern—the mora ity 


factor. Confidence in a factor rests on the framework of known factors about 


It also being replicated and, in this case, the obvious match of most of these 
dence in our main result. Evaluations 


to Previous fac а 9 a fi 

tors justifies higher con t 
ОЁ goodness of aie were made by the Cattell-and-Baggaley salient- 
Variable similarity index, s (7), the values of which are discussed under each 
factor. » 


following section, it has seemed 


ғ CULTURE PATTERN DIMENSIONS 


e and morality factor (Factor 3 in the 
h the original one so identified (3). Its 


Major-loading variables, given in rank order, are as follows (with adjectives 
describing the high cad of the factor and the number of the variable given 


in parentheses) : low syphilis death rate (1), low tuberculosis death rate (2), 
en me 


E. VERIFICATION O 


The pattern of the main moral 
Présent study) clearly matched wit 


cluded i iginal variable was kept in the analysis 
п i es, the origin Я d 
elided in the analysis, Та both casey ghe, o ро to аву ара cune ae 
Silinear relationships of Variable 1 are negligible. Varia И Variable 17 are limited. 
12е; therefore it Et that the curvilinear relationships for А 
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TABLE 2 (continued) 
Factor 

Variable 1 2 3 4 5 6 7 8 9 10 11 12 13 14 
33 41 15 44 —06 06 —22 —23 —50 —05 78 —01 —02 26 —37 
34  —4 —40 05 —45 —01  —105 —01 12 04 02 06 —38 —18 —10 
35 49 —50 08 —23 —03 07 03 03 —09 —20 19 43 —05 —10 
36 02 98 —55 —35 10 10 03 15 —46 80 09 —53 —09 —07 
37 17 —79 —03 —34 13 08 —31 14 —01 —30 —09 —34 —34 10 
38 —75 02 —18 02 10 —103 —11 —09 01 15 —07 —16 —11 —34 
39 64 —08 1 —02 38 42 —11 —15 18 44 21 —08 19 —51 
40 02 —15 —10 —15 —01 —18 05 —03 08 04 50 40 18 —05 
41 06 100 08 52 —03 13 —05 —09 00 17 33 03 23 —13 
42 94 27 56 —60 13 23 08 31 —05 50 —07 —10 09 —03 
43 —06 —79 —07 —11 52 13 —26 —34 20 52 04 02 10 —13 
44 —58 46 —11 03 28 00 —01 45 00 19 —27 18 —23 —37 
45 13 —87 —04 —16 —11 28 —10 —45 24 06 33 16 —37 39 
46 91 —06 —15 06 58 46 —04 —36 10 06 02 00 —11 —19 
47 02 17 —19 06 56 00 —10 28 —05 —06 42 —40 09 17 
48 09 10 07 15 08 32 —05 —05 —34 —39 —05 —22 —03 00 
49 —26 —19 —27 47 —22 —10 —03 02 —04 —44 —19 13 15 —03 
50 —28 29 32 21 04 13 04 50 58 06 —09 —14 08 07 
51 —15 —12 44 —03 01 —25 14 00 05 46 —04 02 —43 —19 
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TABLE 3 
PRIMARY-FACTOR CoRRELATIONS* 
Factor 
Factor 1 2 3 4 8 6 7 8 9 10 11 12 13 14 
1 
2 —60 
3 —48 52 
4 60 —49 47 
S „202 50 17—17 
6 —54 21 31 —н —28 
7 44 —39 —24 50 —21 —16 
8 36 —67 —54 51 —36 —10 39 
9 24 —38 —35 01 —20 07 —02 32 
10 69 —67 —54 64 —25 —Hu 48 59 30 
11 —35 35 13 —27 01 27 35 —20 —03 —22 
12 34 20 —01 16 45 —59 07 —30 —17 12 —10 
13 22 —42 —17 05 —19 —15 —07 26 20 09 —37 —12 
14 20 30 08 27 39 —48 22 —29 —39 11 09 60 —29 
* See text, pp. 85 to 95, for definitions of factors. 
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small percentage of population Catholics (29), large percentage of population 
Protestants (42), small proportion of births illegitimate (36), low alcohol 
death rate (4), large percentage of population Mohammedans (28), low 
homicide and war death rate (5), low typhoid-fever death rate (7), and 
high divorce rate (33). In the present study, it is being called Morality- 
Responsibility vs. Poor Cultural Integration. 

When this factor is statistically matched with Cattell’s® Factor 12 on 
20 comparable variables, the s index gives significant match (2 < .05). 
Beyond this, the “subjective” judgment of similarity is also strong. However, 
the factor has been simplified through several variables (6 and 10) not loading 
the present factor. These had led to an earlier hypothesis of the factor being 
related to slum conditions, but the connection is slighter now. (Incidentally, 
no importance should be given to the fact that the factor here ranks third 
now in terms of variance accounted instead of 12th as in Cattell since we 
deliberately limited the study here primarily to the variables in which this 
factor plays a major role.) 

There are several hypotheses that, from a superficial examination of the 
variables, could easily mislead the unwary. As Cattell (3, p. 466) earlier 
noted, the factor must not be confused with “health influences” for two 
reasons: (a) in this study, many variables such as death rate and physician 
rate load a distinct factor, whereas it would be imperative for them to load 
this one if the factor was concerned specifically with health measures, and (5) 
the death rates related to this factor are specifically only those that medical 
science can greatly alleviate provided the individual or society is sufficiently 
motivated to act. However, under conditions of low morale, the persons 
responsible and their community supporters do not act in accord with these 
responsibilities and the high death rates follow—not to mention a decrease in 
TUN s three death rates, two (syphilis and alcoholism) can be easily 
prevented through appropriate abstinence or through proper treatment avail- 
able in many of the countries studied ; therefore it appears that some lack 
the foresight, self-responsibility, and internalized controls to act in 3 
preventive manner. This, of course, may be the result of the culture's 
general approach to areas needing such responsibility. ; 

'The tuberculosis death rate is twice affected by a person's acceptance of 
responsibility. First, the incident rate is influenced by personal hygiene, 
particularly as it affects general resistance and the carefulness a person 


Se i in this section are to the 
А ences to previous factors in 
5 All matches with and refer except when otherwise noted. 


original factors as given in Cattell (3), 
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exhibits when in possible contact with the tubercle bacilli. Secondly, and more 
important for this factor, tuberculosis is a chronic relapsing disease, such that 
if a person does not strictly follow medical advice, a relapse and possibly death 
will result. (Note that the other factors have essentially removed all the 
variance in these variables attributable to economic and medical development; 
therefore, these two interpretations of these particular variables cannot be 
used here.) Thus, in this death rate there also appears to be a personality 
element involving the related concepts of internalized controls, accepting 
responsibility for oneself, and utilizing self-control to follow the best course. 
Similar interpretations can be given to many of the other variables. 

Other variables one would expect to load the factor under the interpreta- 
tion developed here usually have the correct sign and are out of the hyper- 
plane but not sufficiently so to be salient with a small sample. 

A question that immediately arises is whether this is a population character- 
istic of "general morality level," or a syntality characteristic of “morale,” 
where the members of the group uphold the standards because of the group 
atmosphere. These have been sometimes confused in many of the writings 
on group dynamics, where measures of individual verbal attitudes have been 
taken as identical with a group syntality measure. Granted that most 
theorists consider morality and morale distinct concepts, let us state our 
present hypothesis on this distinction. 

The analyses previously reported and the present one have not permitted, 
as confessed, anything like an adequate degree of hypothesis-directed control 
of the choice of variables in relation to morale hypotheses. Consequently, our 
theory of morale rests primarily on the small-group experiments. In writings 
on morale generally it has meant anything from euphoria to normality and 
from group effectiveness to conformity. As it derives from the Cattell and 
Stice, Gibb, and Lawson studies (9, 12, 19), it means primarily the magni- 
tude of total synergy (determining especially cohesion, the group resistance 
to its disruption) as contributed to largely by the distinct leadership (personal 
or ideational values) and social (gregariousness or "buddy" satisfaction) 
primary factors. At present the nature of the remaining three or so morale 
primaries is not interpreted, but the “fortitude” factor is shown by its correla- 
tions to be tied to qualities of personality (16 PF measures) in the popul 

Since small groups taken for experiments are always taken out of ] 
cultural groups and borrow their morality from the cultures of the latte arge 
the superego habits of the constituent members), the resulting Sues 
variance, complicated by the poor choice of variables used, may account for the 


ation, 
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experimental failure to find in small groups the morality factor found in 
large groups. Alternatively, however, one may argue that the distinctive 
leadership synergy factor is the small-group equivalent of large-group morality, 
provided we include leadership of an idea or value in the former. 

By either approach morality, as “population conformity to ideals,” is only 
one of the full range of factors that contributes to the group behaviors 
commonly included under the term “morale.” At the same time, acceptance 
by the population of values of self-sacrifice, the Ten Commandments, and 
other values of altruistic morality probably play a definite role in the success 
of the group and its resistance to disruption: i.e., to the totality of morale. 
This theory that despite the variations in religious form of universalist ethics 
those ethics have essentially developed as intergroup survival adaptations 
within large groups has been worked out more fully by Cattell (2). The factor 
found above in national groups clearly appears to correspond to this morality 
contribution to morale—not to all influences favoring group survival. Even 
so, it deserves investigation without prejudice that its empirical form will 
be restricted to manifestations fitting the dictionary definition of morality. In 
particular, it may include further features, such as higher group integration, 
cultural consistency, and the social conscience of a group regarding its members. 

'The definition of the morality dimension in factor form makes possible 
a weighted composite to measure this aspect of cultures. From an averaging 
of the several studies mentioned, with particular emphasis on the present one, 
we arrive at the following for use with standard scores to determine low 
morality : syphilis death rate, .6; alcoholic death rate, .4; homicide death rate, 
.4; proportion of births (illegitimate), .2; people per dwelling, .2; and 
tuberculosis death rate .1. И . 

The foregoing weights were subjectively determined on the basis of 
consistency of loading in the studies in which the variable appeared, magni- 
tude of loading, and calculation of multiple-regression coefficients on one 


" 3, 
sample (which showed, for example, that the tuberculosis death rate's 
dominately accounted for in the same variance as the 


contribution is pre c 3 : 
thus necessitating a lower weight than otherwise would 


syphilis death rate; | h 1 
be accorded to it). Variables loading at a borderline level in the present 


study have been included only if they were salient in other studies. Naturally, 
the weighting system is tentative and must be improved through further study. 

When countries were scored on the four most-salient variables of this 
factor (in their standard-score form), Brazil, Burma, Dominican Republic, 
Mexico, and Portugal were found among those low; while Denmark, Iceland, 
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Israel, and Sweden were prominent at the high end of the factor. From a 
world-reputation viewpoint, the former countries fit the suggested syndrome 
of low morale, immorality, and denial of responsibility more than the latter. 

From the primary factor correlations of Table 3, one sees this factor 
forming a cluster with Factor 2 positively and with Factors 8 and 10 
negatively, confirming some of the previously found relationships; the lack of 
cultural integration is a possible basis of the cluster. 

Factor 1, Vigorous Development vs. Lack of Exploitation of Resources, 
confirms Cattell’s factor of “Vigorous, Self-Willed Order vs. Unadapted 
Perseveration,” with a match at the .03 probability level by the s index. The 
variables loading it are many miles of railroad per capita (6), high national 
income per capita (20), large percentage of population Protestants (42), 
large UNICEF contributions per capita (46), many telephones per capita 
(21), few people per dwelling (10), high international-aid index (37), and 
little news censorship (38). While poorly established in Cattell’s study, it 
is clearly here one of the general control of environment based on a well- 
developed technology and, from the low news-censorship loading, a society not 
afraid of free exchange of ideas. The indications are that it is a result of 
something akin to a high self-assertive erg on a national level and is intrinsically 
inherent in high group effectiveness. 

Whereas Factor 1 related to technological development, Factor 2 (Under- 
developed “Backwardness” vs. Complex Social Systems) appears to relate to 
the development of “culture” in the narrower sense in the people themselves. 
Its major variables are large population increase rate (41), large proportion 
of births illegitimate (36), few students per capita (45), few government 
ministries (19), low population density (11), low international-aid index 
(37), few religious books published (43), and low physician rate (40). The 
high end of the continuum would be typified by the ignorant, backward 
people who have little contact with the government (e.g., illegitimate births 
is a function of morality and the extent to which the government legalizes 
and records marriages throughout the country). Cattell’s Factor 10 (Fastid- 
um ee isa tentative match for this factor. 
без б RR н DE in the previous factor’s title, 
pattern with the presen antok at = aS (which matches in actual 
Patriarchal Solidarity vs. Stimulati : Ar Loris ^ level), Factor 4, 
from Cattell's earlier factor with it: d cdm a basically unchanged 

п ОБ з variables being few miles of railroad per 
capita (6), low heart-disease death rate (25), low cancer death rate (23) 
low suicide death rate (13), high Percentage of government budget io: 


a 


RAYMOND В. CATTELL AND RICHARD L. GORSUCH 93 


defense (27), low Percentage of population Protestants (42), and low 
climatic efficiency (30). Here Cattell’s earlier interpretation is not only 
confirmed but also expanded. The death rates show a lack of stress, and the 
climatic-efficiency variable indicates that the countries high on this factor 
are those in which the climate discourages high standards of living. The 
factor is in keeping with Huntington’s hypotheses (17) concerning this 
variable. 

Factor 5, Size, is defined by the following variables: large area (14), 
large population (16), many government ministries (19), large UNICEF 
contribution per capita (46), many Latin American countries (47), and many 
religious books published (43). Both the match to Cattell's first factor of 
Size (at the .01 level), and the "size" interpretation of Factor 5 cannot 
be doubted. Two of the variables show interesting effects: (а) UNICEF 
contributions to feed starving children show what one might call the “Big 
Brother” effect of the modern political climate and (2) religious books are 
related to population size but science books seem, at present, not to be; thus 
showing a discrepancy between interest in science and religion on a world- 
wide view. 

Factor 6, Democracy vs. Totalitarianism, is loaded by these variables: 
many political parties (26), little use of forced labor (34), little news 
censorship (38), and- upward cost-of-living trend (32). Although there 
is not a match in any earlier study for this factor, due to a lack of markers, 
one can easily accept its reality in terms of new but obvious interpretations. 
It is at this point that we would have liked to have been able to interview 
natives in relation to morale. 

Another clear factor, matching with Cattell's factor of Buddhist-Mongolism 
—though it happens to come reversed—at the .01 level, is Factor 7: Occidental 
vs. Buddhist-Mongolism, loaded by small percentage of government budget 
for defense (27), small percentage of population Buddhist (24), large per- 
centage of population Catholic (29), small population (16), small area (14), 
and small percentage of population Mohammedan (28). The religions loading 
this factor tend not only to be mutually exclusive but also to exert their 
major influences in different parts of the world; moreover it places the 
small Western country, which appears to be relatively homogeneous, in 
opposition to the larger but also homogeneous countries of the East. There- 
fore, we reject an exclusively religious interpretation in favor of a broader 
cultural interpretation that includes the religious elements. 

A probability at the p = .05 level is found for the match between Factor 
8, Narrowness vs. Enlightened Affluence, and Cattell’s factor of “Enlightened 
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Affluence vs. Narrow Poverty,” although of course reversed in direction. The 
following variables are salient: low population density (11), large area 
(14), and low marriage rate (9). What is surprising is its smallness in the 
present study since it was one of the better determined and larger factors in 
the original work. Two hypotheses, which present evidence cannot decide 
between, can be offered for this change. The first and most probable is that 
our selection of variables eliminated many of those related to this factor. 
The second hypothesis is that since the time of the data-gathering periods 
of previous studies, which mostly used data collected before World War II, 
the variance of this factor has decreased as more countries gained the minimum 
technological development to begin providing schools, etc. The area seems ripe 
for more extensive study. 

Factor 9, Cultural Pressure vs. Direct Expression of Drives, is loaded 
by many clashes (17), random variable (50), small proportion of births 
illegitimate (36), and small percentage of population Mohammedan (28). It 
has the only major loading of a random variable (which warns one not to 
put too much faith in any single variable’s loading) but is still clearly the 
same factor Cattell found (matches at the .01 level). It seems to differ 
from Factor 4, Solidarity vs. Stimulating Atmosphere, in being cultural; 
whereas the cultural ferment in Factor 4 appears to be more dynamic. 

Factor 10, Anomie vs. Channeled Emotionality, has the following variables 
salient: high suicide death rate (13), large proportion of births illegitimate 

(36), and high divorce rate (33). It bears a striking resemblance to some 
of Durkheim’s (11) work on suicide rates in relation to divorce, etc. His 
general theory states that suicides result from “anomie” since the society 
can no longer serve its normal function of channeling the person’s emotions 
into suitable outlets and social solidarities. The extra tension produces a 
situation which often becomes intolerable to the person and results in suicide. 
Its s index match is at the .05 level with Cattell’s factor which places the 
more rational and scientific aspects of life in opposition to the more emotional 
ones. 

Factor 11, Medical Development, is another factor without a match in 
Cattell's analysis, apparently because the necessary data were not available 
there; it is clearly the result of health through proper treatment, which the 
countries with sufficient resources are able to afford. Its salient variables are 
high physician rate (40), high national income per capita (20), low tuber- 
culosis death rate (2), and low death rate (8). 

The last three factors are, in this study, too small to interpret with any 
reasonable degree of confidence. Factor 12 matches (at the .03 level) with 
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Cattell’s Emancipated Urban Rationalism vs. Unsophisticated Stability and 
Factor 13 with Bourgeois Philistinism vs. Reckless Bohemianism (.03 level). 


F. SUMMARY 


This research aimed to clarify a cultural dimension of morale and morality 
found in rough form in several earlier factor analyses of national statistics. 
Data on 48 variables for 52 countries in the early 1950s were correlated and 
factored (with three random variables) yielding 14 factors rotated to simple 
structure. 

The majority of these factors match statistically in pattern and in associated 
meaning the factors found by Cattell on data of the 1930s and 1940s; though 
two comparatively clear new factors are added, tentatively named Democracy 
vs. Totalitarianism and Medical Development. Although our new variables 
enriched the other factors, their main contribution was to the morality factor. 
Since this factor repeats its pattern at a significant level (p < .05) and 
stands in a framework of nine other factors just as the former factor did, its 
identity as a distinct factor is supported. 

The meaning of the morality factor is conceptually distinct as a component 
of morale, which is probably a second-order factor; the present evidence 
reduces its association with slum conditions or health conditions per se, and 
its absence of loading on school education suggests it is an aspect of cultural 
values passed on more through the family, the peer group, or religious organiza- 
tions. Central in it is a sense of community responsibility and superindividual 
dedication, probably not transmitted through laws and outer controls but 


through inner values. 
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SOCIAL-DESIRABILITY RATINGS OF PERSONALITY ITEMS 
BY THREE SUBCULTURAL GROUPS*1 


Department of Psychology, University of British Columbia 


CAROL JEAN Diers 


A. INTRODUCTION 


In constructing his Personal Preference Schedule, Edwards (2) attempted 
to control the tendency for subjects to respond to personality questionnaires 
in a socially desirable manner. Items in the EPPS were scaled for degree of 
social desirability and two items, each representing a different personality 
variable, were paired in terms of similar social-desirability scale values. In 
taking the schedule, the testee is forced to make a choice between paired 
items. 

Judgments of how desirable or undesirable various groups consider certain 
attitudes or behavior in others have been obtained to determine the generality 
of these ratings (3, 6). Intercorrelations of social-desirability ratings of 
EPPS items are consistently positive and significant. This has led to the 
suggestion that there is a social-desirability stereotype that persists across 
many subcultural groups. If such were the case then tests, such as the EPPS, 
may be constructed to control the social-desirability variable in personality 
testing. . . 

In the case of the EPPS, an additional examination of the intraclass cor- 
relation between matched pairs is required before one can conclude that the 
social-desirability variable is controlled for a given group. Edwards (2) 
reports an intraclass correlation of .85 for social-desirability scale values used 
in constructing the EPPS. Klett (5) reports the correlation between matched 
pairs for high-school students as .69 and Fujita (4) found an intraclass 
correlation of .76 for the Japanese-American group, 6 

Klett апі Yaukey (6) summarize social-desirability ratings of EPPS 
items made by six different groups. They indicate that even though there are 
high positive correlations on the judgments by various groups there may be 


* Received in the Editorial Office, Provincetown, Massachusetts, on April 6, 1964. 
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important systematic differences when similar items are grouped according 
to the 14 “needs” the EPPS purportedly measures. Klett and Yaukey use 
these differences in need rating to infer similar differences in the groups 
making the rating. 

This study reports analyses of judgments made by three new groups. The 
first hypothesis was that intercorrelations of judgments made by the three 
new groups with those made by the six groups reported by Klett and Yaukey 
would be positively significant. The second hypothesis was that intraclass 
correlation of item pairs for the three new groups would be positively signifi- 
cant to a degree warranting the assumption that the social-desirability variable 
would be controlled for these groups. The third hypothesis was that these 
three groups and Edwards’ original University of Washington sample would 
not differ in mean rating of the 14 EPPS needs. 


B. METHOD 


Social-desirability judgments of the 135 statements in the EPPS were 
obtained from (a) University of British Columbia students, male and female; 
(6) Hungarian immigrant university students, male?; and (c) Canadian 
juvenile delinquents, female.? 

The Canadian university sample consisted of 123 males and 103 females 
enroled in two sections of introductory psychology at the University of 
British Columbia. The age range for the 226 subjects was 17 to 36, with 
a mean age of 21.2 and a standard deviation of 3.8 for the males, and a 
mean age of 18.8 and a standard deviation of 2.1 for the females. 

'The Hungarians were 70 male students who were part of the University 
of British Columbia, Faculty of Forestry, Sopron Division. Shortly after 
the Soviet-occupation authorities employed armed force on November 4, 
1956, to suppress the October 23, 1956, Hungarian revolution, the faculty 
and students of the School of Forestry, located in Sopron, left Hungary as 
a group. After a brief stay in Austria, this group immigrated to Canada and 
resumed their studies at the University of British Columbia. The major 
portion of their university classes were taught by the original Sopron faculty 
and except for special classes in English the classes were taught in Hungarian. 
Retention of the internal group structure has no doubt retarded the assimila- 
tion of individual Hungarian students into Canadian society. The group had 


? Dean R. G. Roller of the Sopron School of Forestry was very helpful in permitting 
testing of the Hungarian students. 

3 The author is indebted to Miss D. Johnstone, Treatment Director, Girls Indus- 
trial Home for permission to test the juvenile delinquents. 
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been in Canada 14 months at the time the social-desirability scale values were 
obtained. It is worth noting that the official policy at the Sopron School of 
Forestry was to admit 70 per cent of the students from the peasant and 
worker classes and 30 per cent from the intellectual and white-collar classes. 
The immigrant group consisted of a slightly higher percentage of individuals 
from the white-collar classes than this expressed ideal (7). The age range 
of the 70 Hungarian students was from 20 to 32 years, with a mean of 22.7 
and a standard deviation of 2.6. 

The delinquent group consisted of 40 institutionalized females in the 
Girls Industrial Home, Vancouver, B. C. The most frequent charge against 
this sample was “incorrigibility,” a legal term covering most types of mis- 
demeanors, other than that of sex offenses, which was the second most- 
frequent charge. According to the Treatment Director, most of the girls 
were from the lower socioeconomic strata and about one-half were wards of 
the province or some private children’s agency. The age range of the 40 
delinquents was 13 to 18 years, with a mean of 15.7 and a standard deviation 
of 1.2. 

All subjects rated the items on a nine-point scale of social desirability 
according to instructions outlined by Edwards (3). The Canadian university 
students gave their ratings during a regular class hour, as did the Hungarians. 
Instructions and statements were administered in Magyar to the Hungarians.* 
The juvenile-delinquent ratings were obtained in an assembly called for that 
purpose. 

C. ANALYSES AND RESULTS 

Social-desirability scale values were computed by the method of successive 

intervals. Male and female ratings given by the Canadian university sample 


were first computed separately. The correlation between the two sets of 


ratings was .95. Successive-interval ratings were then obtained for the com- 


bined sample of 226. | 
The ratings obtained from the three groups were correlated with all other 
ratings reported by Klett and Yaukey,® as shown in Table Ж 
All correlations are significant at the .01 level. (Nine items from need abase- 


ment are not included because data were not available from all samples.) 


The result in the main substantiate the findings of previous investigations; 
s several subcultural groups is a common agreement 


that is, persisting acros: ups : Е 
ы = characteristics. Excluding Hungarian 


regarding socially desirable personality 


4 Thanks are due Dr. J. Huberman for translation into Magyar and administration 
of th i ing scale. Te 
5 Se cess ous ү by C. J. Klett, personal communication, 1956. 
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TABLE 1 
INTERCORRELATION OF SocIAL-DESIRABILITY SCALE VALUES FOR CANADIAN, 
HUNGARIAN, AND DELINQUENT SAMPLES WITH SIX OTHER GROUPS 


Canadian 
Group University Hungarian Delinquent 

American University 95 71 58 
Hospital 81 43 50 
High School 90 -65 .62 
Beirut 32 78 54 
Norway .80 71 .62 
Nisei 93 69 51 
Delinquent 54 47 

Hungarian 72 47 


and delinquent ratings, the Canadian university correlations range from .80 
to .95. This is comparable to previous reports. However, the other two groups 
in this study (while providing significantly positive correlations) are more 
deviant than any previously reported. The Hungarian correlations range 
from .65 to .78. All the correlations with the delinquent sample are lower 
than those thus far obtained, the range being .47 to .62. 

Matched-pair correlations indicate the degree to which the forced-choice 
items would be equated for social-desirability scale value if the EPPS were 
used on these populations. An intraclass correlation of .73 was obtained from 
the Canadian university sample. While this correlation is significantly dif- 
ferent (at p < .05) from the original matched-pair correlation of .85 ob- 
tained by Edwards (2), nevertheless a good degree of covariation is 
evident. The intraclass correlations of .38 obtained from the Hungarian 
group and that of .35 for the delinquent group do not seem adequate to 
warrant the assumption that the EPPS controls the set to respond in a 
socially desirable manner for these populations. 

Item ratings by the three new groups plus Edwards’ original sample of 
152 American university students were grouped according to the 14 needs, 
and analyses of variance were performed to compare mean need rating. In 
order to eliminate possible rating-scale bias, each of the four sets of scale 
values was transformed into normal deviates with a mean of 50 and a standard 
deviation of 10. Means and variances were thus equated on the total set of 
items, but individual items within each group were left free to vary. The 
135 items were then grouped into the 14 needs, and mean ratings on each 
of the needs were computed for each group. The analyses indicated whether 
differences in item ratings were confined to a specific need or could be ac- 
counted for in terms of random variation. 

Eleven of the 14 needs showed significant differences. Deference, succor- 
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ance, and nurturance produced nonsignificant F ratios. Mean ratings on ex- 
hibitionism, affiliation, and dominance were significantly different (at the 
:05 level). The remaining needs—achievement, order, autonomy, intraception, 
change, endurance, heterosexuality, and aggression—varied significantly in 
mean rating (at the .01 level). 

Duncan's multiple-range test (1) was employed in the 11 needs that re- 
sulted in significant F ratios to determine the specific group or groups con- 
tributing to the differences. The group means on each of the “needs” and 
significant differences in mean rating are presented in Table 2. 

'There are no indications that American and Canadian students differ 
significantly on any of the needs rated. Further, Hungarian, Canadian, and 
American students regard the needs of achievement, exhibitionism, intracep- 
tion, change, endurance, and heterosexuality in a similar manner. Hungarians 
regard the need affiliation as less desirable than do the American and Canadian 
college students and Hungarians rate the need dominance as less desirable 
than do Canadian students. The need order is rated higher by the Hungarian 
group than by any others. 

Compared to the three other groups, Canadian delinquents rate the need 
order, intraception, and endurance significantly lower in desirability. Com- 
pared to American and Canadian college students, delinquents undervalue 
the need achievement. The needs exhibitionism, autonomy, change, hetero- 
sexuality, and aggression are rated as more desirable by the delinquents than 
by the Canadian group. Of these five needs, the delinquents view the needs 
autonomy, change, and aggression as more desirable than do the American 
college students; and the needs exhibitionism, change, and heterosexuality as 


more desirable than do the Hungarians. 


D. Discussion 


It should be noted that evaluation of a trait as being socially desirable does 
not necessarily indicate a corresponding high need; for example, a group may 
rate aggression as being socially desirable, but that same group may or may 
not have high needs for aggression. The data presented merely indicate 
which traits certain groups rate as being more or less socially desirable than 
other groups. А | | 

Canadian and American college students, who show no differences in their 
need ratings, will be taken as referent points for discussing the Hungarian 

» 
and delinquent ratings. : 
Hungarians value the need order more highly than any other group. They 


also tend to evaluate the need aggression more highly than do Americans 


col 


TABLE 2 
EPPS Neep MEAN SCORES AND SOURCE OF SIGNIFICANT DIFFERENCES ON SociAL-DEsiRABILITY RATINGS 


BY AMERICANS, CANADIANS, HUNGARIANS, AND JUVENILE DELINQUENTS ы 
De- Hun- Ca- Amer- Е 
linquents garians nadians icans Differences* 2 
Need (D) (H) (C) (A) DH DC DA HC HA CA E 
Achievement 49.07 52.38 54.97 55.97 — .01 .01 = ете о 
Order 48.36 60.60 53.01 53.81 01 01 01 01 01 т 
Exhibitionism 45.61 38.71 40.85 42.09 01 .05 == pun = = © 
Autonomy 51.33 48.07 43.53 45.35 — .01 01 05 — — Q 
Affiliation 58.95 56.36 60.85 59.75 — — — .01 .05 — ü 
Intraception 48.13 53.02 52.16 51.34 01 .01 .05 — — — 
Dominance 39.02 37.85 46.02 42.20 — .05 — 01 — 5 5 
Change 60.44 51.70 53.61 54.31 01 01 01 = -— = s 
Endurance 48.78 57.97 56.58 53.88 01 01 05 = = кем т 
Heterosexuality 54.75 47.85 48.52 5142 .01 01 — = — = o 
Aggression 48.87 42.68 32.19 30.99 — .01 01 05 05 — 5 
Deference 47.34 47.21 51.49 50.98 Q 
Succorance 44.72 48.12 47.91 50.66 
Nurturance 54.53 57.51 58.34 57.30 


* Entries show differences significant at the .05 and .01 levels. 
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and Canadians. Hungarians rate the need affiliation as being less socially 
desirable than the reference groups. This group of Hungarian students could 
be described as having acted in accord with the values they espouse. During 
a time of relative disorder in their own country they attained order by moving 
to another continent. Breaking away from their homeland may have been 
made easier by their relative underrating of the need affiliation. Of course 
they could also be rating order as relatively more socially desirable and 
affiliation as relatively less socially desirable as a later justification for their 
action. 

Compared to American and Canadian college students, Canadian delinquents 
rate as more socially desirable the needs autonomy, change, and aggression. 
They also tend to rate exhibitionism and heterosexuality as more socially 
desirable than do the Canadians, but no significant differences appear between 
delinquents and Americans on these trait ratings. Delinquent mean rating of 
the needs achievement and order indicate they consider these traits less 
socially desirable than do the reference group. They tend also to undervalue 
the needs of intraception and endurance. Perhaps almost anyone who is 
forcibly institutionalized might be expected to value autonomy and change 
as socially desirable. On the other hand, the comparative low value placed on 
achievement and order corresponds to their particular deviant position in 
society. From these data it is impossible to say either (a) that the value system 
as reflected here has in part determined the delinquency or (b) that being 
in the role of institutionalized delinquent has determined the value system. 
It is possible to say here, as in the case of the Hungarian students, that the 
behavior and position of the group seem logically congruent with the need 
ratings. 

To turn to another aspect of these data, Canadian delinquents differ more 
from American and Canadian university students than do the Hungarian 
university students, despite the fact that Hungarians come from a different 
subcultural background and language family. It may be that what has been 
alled the social-desirability stereotype persists across widely different groups 
provided these groups are comparable in intelligence, education, and social 
class within the Western European cultural continuum. 

Personality tests are usually constructed, standardized, and cross validated 
h institutions. By far most groups are students. 


on groups available throug! i 1 і 
It is tempting to assume that if groups which vary as widely as Americans, 


Canadians, Norwegians, Nisei, and Lebanese students indicate a stereotype 
a Л : Hon ien 
of what is socially desirable behavior, then subgroups within one nationality 
must not differ significantly from this norm. However, farmers, loggers, 
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fishermen, and delinquents may have different ideas about what behavior is 
socially desirable. Individuals from these groups may produce statistically ab- 


terms of the social-desirability norms of their own group. Similarities in value 
judgment within social class across national lines may be greater than across 
social-class lines within a nationality, 
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THE EFFECTS OF DISCREPANT INFORMATION ON 
PERCEPTUAL PROCESSES: AN APPLICATION OF 
BALANCE THEORY*? 


Behavioral Science Project, University of California, Berkeley 


Davin VERN STIMPSON 


A. INTRODUCTION 


Balance theory, as set forth in T'he Psychology of Interpersonal Relations 
(2), is a general theory of cognitive balance that deals with the relationship 
between two people, P and O, and an impersonal object, X. Heider does not 
suggest operations by which to quantify these elemental relationships, al- 
though others (1) have been successful in doing this. The present study in- 
volved two modifications of Heider's original formulation: (а) incorporation 
of a quantitative indication of relative balance or imbalance and (5) altera- 
tion of the POX system to include three perceptual entities: an individual's 
self-perception, his perception of a group of which he is a member, and his 
perception of how the group perceives him. 

In order to qualify elements that in Heider's original model are dichot- 
omously positive or negative, they were assumed to fall on an evaluative 
continuum with the poles defined as high-positive evaluation and high-nega- 
tive evaluation. The point along this continuum at which an element was 
placed by a subject was used as an indication of his perception of the value 
of that element. By assigning an ordinal scale to the continuum, the quantita- 
tive assessment of any particular element could be obtained. 

The general notion of a strain toward balance indicates that if a balanced 
(harmonious) state does not exist, forces toward this state will arise (2, 4, 7). 
This balance hypothesis was utilized for making predictions of relationships 
among the quantitatively-assessed perceptual elements. It was assumed that a 
cognitive system made up of N elements would be balanced to the degree that 
the same numerical evaluation was assigned to all N elements and unbalanced 
to the degree that a disparity existed between the evaluations assigned to the 
N elements. A similar hypothesis, concerned with the relationship between 
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two cognitive elements, has been experimentally supported by Osgood, Suci, 
and Tannenbaum (6). 

In the present study, answers were sought to the following two questions: 
(a) What effect does the introduction of discrepant objective information into 
an individual’s cognitive system have on the related elements of the system? 
and (b) Are there consistent differences among individuals in the amounts 
of cognitive discrepancy they will tolerate? 

The above questions were translated into two formal hypotheses: Hypoth- 
esis I. If a discrepant objective element is introduced into a congruent 
subjective system, subjective elements will undergo re-evaluation in order 
to restore congruity to the system. Hypothesis II. There are relatively stable 
individual differences in the amount of incongruity that will be tolerated in 
any particular cognitive system. 


B. METHOD 


In order to test the hypotheses, we made direct comparisons of the evalua- 
tions assigned to several different objects by the same person. The semantic 
differential scale devised by Osgood, Suci, and Tannenbaum (6), which 
utilizes pairs of bipolar adjectives, was chosen to facilitate this comparison. 
Since these scales are relatively independent of the immediate conceptual 
context, they are well suited for the purposes of this study. Six pairs of ad- 
jectives were used to quantify the cognitive elements: good-bad, necessary- 
unnecessary, worthless-valuable, successful-unsuccessful, influential-uninfluen- 
tial, and important-unimportant. 


1. Subjects 


Subjects were 42 upper-division men and women students at Brigham 
Young University. They were enrolled in two sections of a class in group 
relations; 20 students were in one section, 22 were in the other. The sections 
met twice a week for two hours per meeting throughout the semester. The 
sections were conducted primarily according to group-centered training 
techniques, with no formal agenda. 

Each subject was asked to rate four groups of objects on each of the six 
scales, with a maximum possible range of seven points on each scale: himself 
as a member of the group, the group, his perception of the group's evaluation 
of him, and each individual member of the group. А total score for each of 
the evaluated objects was derived by summing across the six scales, makin 
a maximum possible evaluation of 42 and a minimum possible of lime 2 
any one attitude object. Several pairs of distractor adjectives were included 
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in the scale but not scored; the direction of scoring was randomized to con- 
trol for response set. 

Students in both sections first responded to the semantic-differential scales 
after they had met together for a total of approximately 14 hours of inter- 
action. From the responses it was determined how each person evaluated 
himself as a group member, how each person evaluated the group as a whole, 
how each person perceived the group as evaluating him, and how each person 
evaluated every other member of the group. By combining scores, the average 
evaluation given each individual by the group was also determined. 

As was indicated earlier, the cognitive system of concern in this investiga- 
tion consisted of three elements which may be diagrammed as follows: 


self 

evaluation 
perceived 
group 
evaluation 
of self 

evaluation 

of group А 


In keeping with the predictive scheme proposed, the cognitive system will 
be congruent to the degree that all elements of the system are similarly 
evaluated. It will be incongruent to the degree that there is lack of agree- 
ment among the numerical values assigned to the elements. 


2. Treatment 


Subjects were divided into three groups: (а) positive incongruent—when 
the average group evaluation of $ was higher than his evaluation of himself, 
(2) congruent—when the group evaluation of $ was approximately equal to 
Ss evaluation of himself, and (c) negative incongruent—when the group 
evaluation of 8 was lower than Ss evaluation of himself. Each subject was 
presented with a graph on which were indicated his self-evaluation and an 
average evaluation purportedly assigned to him by the group. This group 
evaluation had been changed (without S's being aware of it) to increase the 
disparity between ratings for incongruent $$ and to minimize the disparity 
for congruent Ss. Sample graphs for each of the three conditions are given 
in Figure 1. A total of 14 students fell into each of the treatment conditions 
indicated in Figure 1. 

It was assumed that presentation of these graphs would create a situation 
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FIGURE 1 
SAMPLE GRAPHS ILLUSTRATING PosirivE-INCONGRUENT FEEDBACK, NEGATIVE-INCONGRUENT 
FEEDBACK, AND CONGRUENT FEEDBACK 
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of cognitive imbalance in the incongruent conditions and a situation of cog- 
nitive balance in the congruent condition. The latter condition was used as 
a comparison group against which to test the effects of the experimental 
treatment. 

The graphs, constituting the experimental treatment, were given to the 
subjects on approximately the third meeting of each section after the members 
had responded to the set of semantic differential rating scales. Ss were given 10 
minutes to examine the graphs. Then each $ indicated on a seven-point scale 
the amount of discrepancy he perceived between his self-ratings and the 
rating assigned to him by the group, as well as the amount of accuracy 
he attributed to each of the ratings. The graphs were then handed to the 
instructor, and the subjects again responded to the set of semantic differential 


scales. 
C. RESULTS 


To determine whether the “incongruent” situation created by the ex- 
perimenter was so perceived by the subjects, a Spearman rank-difference 
correlation (8) was calculated between the amount of “incongruity” presented 
to each subject (Figure 1) and the degree to which he perceived inconguity 
to exist. The rho is .81, is significant beyond the .001 level; therefore 
it seems safe to assume that the situation defined as incongruent was in fact 
phenomenologically incongruent. 

Resolution of incongruity in a manner not covered by the experimental 
hypotheses could have occurred if a subject had rejected the accuracy of the 
objective discrepant information presented to him (3). To test this possibility 
a Wilcoxon matched-pairs signed-ranks test (8) was done on the perceived 
accuracy ratings. Results indicate no significant difference between positive 
and negative incongruent subjects in the amount of accuracy they attributed 
to the group's rating of them as compared with their self-rating. 

Data relevant to Hypothesis I were analyzed by a Lindquist Type VI 
factorial analysis of variance with pseudoreplication (5). 

The pretreatment and posttreatment evaluations constitute Factor A. The 
objects being evaluated (self, group, or perceived group rating of self) con- 
stitute Factor B. The condition of the subjects (positive incongruent, negative 
incongruent, and congruent) constitute Factor C. The results of the analysis 
are summarized in Table 1. nm . 

The significant treatment-by-subjects interaction indicates a difference 
between pretreatment and posttreatment evaluations assigned by subjects, 
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TABLE 1 
ANALYsIS OF VARIANCE OF EVALUATION SCORES 
Source SS df MS F 
Between Ss 23.557 41 
Subjects 3.32 2 1.66 3.198 
Error (b) 20.237 39 .519 
Within Ss 46.073 210 
Treatment .803 1 +803 6.42* 
Objects 16.715 2 8.357 49.1598 
Interaction 
"Treatment-by-objects .099 2 -049 1.581 
Treatment-by-subjects 5.608 2 2.804 22.43** 
Objects-by-subjects 1.743 4 438 2.576* 
Treatment-by-objects- 
by-subjects .575 4 144 4.645%% 
Error (w) 20.530 195 105 
Error, (уу) 4.882 39 125 
Errors (w) 13.235 78 17 
Errorg (w) 2.413 78 031 
Total 69.63 251 
*p<.05. 
** 5 c 0l. 


which difference depends on the particular treatment given (viz., positive 
incongruent, negative incongruent, or congruent). This significant difference 
results from a tendency on the part of positive incongruent subjects to raise 
their self-evaluations, group evaluations, and perceived group rating of self- 
evaluations; and for negative incongruent subjects to lower their evaluations. 
Evaluations of congruent subjects remained about the same. The tendency 
for the elements of a subjective system to undergo re-evaluation in order to 
establish. congruity with a (previously) discrepant objective element con- 
firms Hypothesis I. 

The significant objects-by-subjects interaction indicates that evaluations 
varied as a function not only of the object being evaluated, but also of the 
treatment given the subject. Apparently subjects re-evaluated the three 
objects selectively, depending on the kind of incongruity they experienced. 
"Table 2 shows mean posttreatment change in evaluations made by subjects in 
the three experimental conditions. 


TABLE 2 
MEAN CHANGE IN EVALUATION RESULTING FROM EXPERIMENTAL TREATMENT 
А Perceived 
Self- Evaluation group eval- 
Group evaluation of group uation of self 
Positive incongruent 5.07 3.78 4.93 
Congruent 1.00 14 244 


Negative incongruent —4.14 — 72 —3.38 


| 
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A subject’s hesitancy to lower his evaluations of the group even when 
the group rated him lower than he thought it would, but a willingness to 
raise his evaluations of the group when the group rated him higher than 
he thought it would, is an interesting phenomenon that deserves further 
investigation. 

The higher-order interaction (treatment-by-objects-by-subjects) confounds 
replication variance with condition variance; so is uninterpretable. 

Hypothesis II (namely, that there are relatively stable individual differences 
in the amount of incongruity that will be tolerated in any particular cognitive 
system) was tested by calculating rho between the total incongruity present 
in a cognitive system before and after the experimental treatment. It is 233 
significant beyond the .001 level. Straightforward acceptance of the hypothesis 
is hindered, however, by further analysis of the data. Rhos computed separately 
for positive incongruent Ss, negative incongruent Ss, and congruent Ss are 
-63, .53, and .32 respectively. This lack of equivalence will be considered in 
the discussion section. 

D. Discussion 


Our data support balance theory, extended to quantitatively-assessed per- 
ceptual elements in the cognitive system of individuals in a relatively enduring 
interpersonal situation. The fact that related elements of the cognitive system 
are re-evaluated by selective amounts to restore relative congruity when an 
incongruent objects element is introduced allows added insight into the 
dynamics of cognitive systems. The majority of investigators of the thesis of 
cognitive balance have focused on the element of the system that is most 
directly affected by the discrepant information. Our data indicate the 
importance of examining a broader spectrum of effects, including the effects 
on related elements of the system. 

Our finding that the amount of change in evaluation of the group is 
dependent on whether a subject is in the positive or negative incongruent 
condition raises question as to the generality of the finding of “congruency” 
for both positive and negative conditions as reported by Taguiri, Bruner, and 
Blake (9). The data reported by Taguiri et al., however, were based on 
person-person relationships; while our data were derived from person-group 
relationships. This difference may account for the lack of agreement between 
results.? : 

The fact that positive incongruent subjects show a higher tendency to 
Teturn the cognitive system to a state of relative congruency than negative 


—— 

? Another Possibility is that there are personality differences between the positive 
and negative incongruent subjects in our study. Since no personality data were col- 
lected, this hypothesis must remain a speculation. 
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incongruent subjects is probably an artifact of the reluctance on the part 
of negative incongruent subjects to re-evaluate the group. This reluctance 
results in a larger discrepancy between the evaluation assigned the group and 
the other elements of the system by the negative incongruent subjects than 
was the case with the positive incongruent subjects. 

Our results demonstrate (а) that balance theory is a useful predictive 
principle even in situations not saturated with demand characteristics; (5) 
that the introduction of discrepant information into a cognitive system causes 
reverberations and adjustments through an extensive part of the affected 
cognitive area; and (c) that different cognitive dynamics operate under 
conditions of positive and negative incongruence. 


E. SUMMARY 


Our purpose in this study was to test a quantitative extension of Heider's 
balance theory in an extraexperimental setting. We sought answers to two 
questions: Does the introduction of discrepant objective information into 
an individual's cognitive system have an effect on related elements of the 
system? and Are there consistent individual differences in the amount of 
cognitive discrepancy that will be tolerated? 

Heider's balance formulation was quantitatively extended by having subjects 
rate relevant attitude objects on а semantic-differential scale. Each subject 
rated himself, each member of the group, the group as a whole, and his 
perception of the group's rating of him. Two weeks later, subjects were given 
feedback in the form of individual graphs that indicated the rating they gave 
themselves and the purported (but distorted) average rating given them by 
the group. Then they rated the same attitude objects on the same scales 
again. 

Comparison of each subject’s initial ratings with his final ratings indicated 
the degree to which the feedback information was incorporated into his 


cognitive system and the degree to which perceptions were changed in order 
to achieve balance among the elements of the system. 


Results indicate that related elements of the Cognitive system were re- 
evaluated by selective amounts in order to restore relative balance to the 
system following the introduction of objective discrepant information. 

Subjects who received positive feedback (group rating of $ higher than 8% 
rating of himself) showed a greater tendency to restore the system to balance 
than subjects who received negative feedback (group rating of $ lower than 
S's rating of himself). It was impossible to discover whether this was due to 
situational variables or personality variables because the subjects in the two 
experimental conditions were self-selected. 


E 


/ 
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GROUP RESPONSES TO THREAT: PERCEPTION 
AND LEARNING UNDER STRESS*: 


Department of Psychology, Rice University 


Bnapronp B. Hupson, Јонм McDavip, AND Marian Roco BINNER 


A. INTRODUCTION 


There are many human behaviors that could be studied to better advantage 
under field conditions than in the as-if laboratory environment. Many extreme 
situations, such as those encountered in disasters cannot be duplicated in the 
laboratory, as Guetzkow (9) has pointed out; but there are some that can 
be approximated. Those that simulate life conditions are useful in the study 
of emotional behavior in the light of its dependence upon spontaneity. 

In this report, the authors are concerned with experiments in which sponta- 
neous emotional responses were elicited in groups by ambiguous environmental 
events that were perceived as threatening. As in life situations, neither the 
perceptions nor the responses were stereotyped. Preliminary studies revealed 
in fact a rich variety. Accordingly, the unique response as well as the uniformi- 
ties in responses are reviewed. Earlier studies with animals have indicated the 
generality of avoidance responses to strange or unfamiliar stimuli. These 
studies with humans, an extension of this earlier work, imply that on the 
emotional level qualitatively similar behaviors are involved. 

In discussing the responses elicited in these studies, the term anxiety has been 
employed to characterize the feeling reported by many of the subjects. 
Murray (22) has used it to designate a broader class of response to include 
apprehension and worry. In situations more extreme than the ones studied 
here other terms might be used: e.g., terror or panic. Agreement, however, 
is not high as to the meaning of this whole class of terms; for example, 
Webster's dictionary refers to panic as a sudden and overpowering fright and 
ultimately lists fear as a synonym. Irving Janis (16), Joost Meerloo (20), and 
Grinker and Speigel (8) have treated panic as behavior; while psychoanalytic 
theory tends to treat panic as a subjective feeling. It has been discussed by 
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Harry Stack Sullivan (25) and Hadley Cantril (5) as a subjective state 
tending to lead to relatively consistent and predictable modes of behavior. 
Other terms, such as fear or apprehension, might serve as well (21); how- 
ever since the stimuli to which the subjects in these experiments were exposed 
was characterized by ambiguity of instrumental origin or of meaning the term 
anxiety seems appropriate in this context—a usage not necessarily coinciding 
with the clinical concept. 

Several experiments conducted in different settings and with subjects vary- 
ing in age and experience provide the background for this report. The first 
experiment was exploratory—the experimental situation was a classroom and 
the subjects were Rice University students in a course in introductory psy- 
chology. The second experiment was conducted in the same setting during a 
regularly scheduled class, but three years later. In addition to these two 
studies, three replications conducted with small groups of adults in business 
or professions attending a two-week course in civil defense provided addi- 
tional qualitative data. 


B. METHOD 


Subjects in the main experiment were 62 undergraduate students at Rice 
University in a class at which approximately 100 were in attendance on the 
experimental day. Of these, 31 males and 31 females who were available for 
appointments and interviews during the week following, comprised the sample.” 
The basic procedure was as follows. During a regular lecture, a sequence of 
sounds was presented. These sounds, assembled on a prepared tape, were broad- 
cast from a series of loudspeakers surrounding the building in which the 
lecture was in progress. The sounds were presented over a period of approxi- 
mately 25 minutes, beginning a few moments after the class started. The first 
sounds were those of newsboys shouting “Extra” at a distance beyond distinct 
audibility. After several minutes, there were sounds of fire equipment and 
again after intervals sounds of distant aircraft. Gradually, the amount of 
aerial and ground activity developed to a point where intervals between 
events represented by the sounds of sirens and mass flights of aircraft, con- 
ventional and jet, were short. Toward the end of the 25-minute period, 
several people were seen running past the open doorway of the lecture hall, 
and at about the same time a stranger entered the room to deliver a note to 
the lecturer. The climax came at this point, accompanied by an air-raid alarm, 


2 Not included in the sample were those students who were aware that an experi- 
ment was being conducted. This group consisted of those who knew of the experiment 
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and almost simultaneously by the sounds of distant explosions and a flash of 
light outside the building. 

The buildup of events was gradual and not (until toward the end) 
intrusive. The subjects were provided no real basis for rejecting assumptions 
that the events that developed during the 25-minute period were real. If they 
accepted as real the presence of newsboys calling “Extra” at the beginning of 
the hour, it was difficult to reject the evidence of subsequent events which 
provided that initial experience with meaning. On the other hand, the course 
of events portrayed by the tape recording was such as to preserve a high degree 
of ambiguity until perhaps the last few seconds, with the sounds of distant 
explosions. The variety of sounds and the rapid development of the situation 
provided no satisfactory basis for the formulation of hypotheses that logically 
would account for all of the activity evidently taking place. 

Upon the termination of the sound sequence, an experimenter entered the 
lecture hall, advised the group that they had been participating in an experi- 
ment, and asked that they remain seated for the administration of a question- 
naire. The questionnaire contained 34 items on reactions to and interpretations 
of the experimental situation. Two days after the first questionnaire a second 
was administered. During the following days the subjects were met for per- 
sonal interviews at which time, in addition to completing specific follow-up 
questionnaires, they were encouraged to give detailed unstructured accounts of 
reactions during the experimental period. 

Observational data were recorded by four observers planted in the lecture 
hall for both the two preceding lectures and on the experimental day. Each 
recorded the behavior of selected groups of students, observing each subject in 
turn for periods of 10 seconds during the period of the experiment. A record 
was made of conversation with others, posture, and gross body-position changes. 
From observer records, considerably more “looking around” was evident on 
the experimental day, the students attempting to catch glimpses through the 
high windows of the aerial activity evidently in progress outside; there was 
more conversation, but the observers were unable to detect other qualitative 
differences in behaviors on the experimental day. They did not report 
apparent signs of anxiety or of emotional reactions, В 

In both the pilot and the main experiment, moving pictures were taken from 
a concealed position, the field of the camera encompassing: some 18 students. 
The photographic data corroborated the observers record in that the moving 
Pictures failed to reveal any individual differences systematically associated 
with measures of anxiety subsequently determined from questionnaires and 


interviews. 
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C. RESULTS AND Discussion 


The strength of the anxiety response was the major dependent variable in 
these studies, and of primary interest are the relationships that other measures 
bear to that response. On the questionnaire administered immediately after the 
termination of the experiment, each subject was asked to estimate graphically 
the development and the amount of anxiety he experienced during the course 
of the experiment. Figure 1 is a sample record of a student’s response. The 
high point on the curve was accepted as the individual’s score on self-rated 
anxiety and was one component in the measure of anxiety. 

Sufficient 


Alarm to 
Leave 


Wanted to 
Leave Class 


Very 
Disturbed 


Disturbed 


Mildly 
Anxious 


No 
Apprehension 


News- Several Mass Jet Students Message Students 
boys Planes Planes Planes Running to Rise to Raid 
Lecturer — Leave Siren 

Class 


Air 


FIGURE 1 
SaMPLE СЕАРН ОЕ SELF-RATED ANXIETY 
(The directions were as follows: “Plot the development of anxiety or apprehension 


that you experienced. . . -" Events are indicated on the base line and degree of anxiety 
is indicated on the vertical axis.) 


This score and data from six other questions? provided the basis for the 


3 Responses to the following questions provided the data from which the judged 
anxiety score for „each individual was derived: (a) Did you experience any afteref- 


ent 
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individual’s measure of anxiety. This score was determined by a panel of 
four judges, each independently sorting the subject’s records into one of six 
categories on the basis of the seven questions. Disagreement among judges on 
score assignments was discussed until agreement was achieved. Table 1 


TABLE 1 
FREQUENCIES IN ANXIETY CATEGORIES 
Judged rating Self rating 
Response magnitude Males Females Males Females 
“disturbed to the point of 
leaving the class” 3 3 2 5 
“disturbed to the point of 
wanting to leave class” 5 6 10 9 
“very disturbed” 7 12 8 5 
“disturbed” 9 9 3 10 
“mildly anxious” 5 1 5 2 
“no anxiety” 2 0 3 0 
Total 31 31 31 31 


represents the frequencies in the several categories for men and women 
and shows both the judged ratings based upon the responses to the seven 
questions listed and the self-rating, as determined from the graphs of anxiety 
levels illustrated in Figure 1. Distributions of the measures of anxiety show 
slightly less dispersion (with fewer cases in the extreme categories) than do 
the self-rating scores. 

The results are treated from two points of view. The first, focused pri- 
marily on the main experiment, concerns the quantitative relationship between 
the strength of the anxiety response and other measures employed. The second 
viewpoint concerns qualitative descriptions. It was evident from responses 
to the questionnaires and from interviews that some of the data would not 
lend themselves to quantitative treatment. The perceptual structuring of the 
situations directly or indirectly reflects the past experience of each individual; 
hence the variety of individual reactions, which constitute an important part 
of the data, are treated separately. 

'The data obtained from the main experiment were analyzed in terms of a 
set of 25 variables derived from content analysis of the interview and question- 
naire records. Since data on a substantial number of variables were categorized, 
the contingency coefficient, C, was used as a measure of association. The 


significant interrelationships observed among variables are listed in "Table 2. 
————— 

comparable to that developed during the experiment? Please describe fully. (g) 
Plot the development of anxiety or apprehension that you experienced on the chart 
below. Events are indicated on the baseline, and degrees of anxiety are indicated on 
the vertical axis. 
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TABLE 2 
SIGNIFICANT RELATIONSHIPS AMONG EXPERIMENTAL VARIABLES 


Contingency Significance Approximate 


Variables coefficient® level Pearson гс 

Anxiety (with) 

ak consistency 45 05 —.57 

Number of alternative hypotheses 47 .01 .60 

Estimated (projected) group anxiety 53 01 71 

Estimated (projected) group anxiety 63 01 720 

(estimate graphed) 

Suggestibility 31 10 41 

Tendency to “scapegoat” 36 .05 48 

Aftereffects 33 05 45 

Retention 31 10 —.42 

Retention —.74¢ 
Hypothesis consistency (with) 

Suggestibility 34 02 —.48 

Specificity of perceived threat .32 .05 45 


а Derived from chi square. 

b From value of chi square. 

© С corrected for categorization. 

4 Computed Pearson r. 

© Computed rho for special group [see text, p. 130]. 


The experimental procedure was uniformly effective in producing anxiety, 
The average strength of the response varied from group to group for reasons 
that are complex. Part of the variability reflects the type of interpretations 
or hypothesis formulated by individual subjects. These in turn are in part a 


in the frequency of interpretations of war. Interpretations, in other words, 
were often associated with Tecent experience. This process of hypothesis 
formation or interpretation of events is the first subject for discussion in the 


1. Hypothesis Formation 


With few exceptions, individuals exposed to the experimental situations 
attempted to organize the events in meaningful ways. This necessarily in- 
volved relating them directly or indirectly to past experience. This process 
involved the formulation of hypotheses or tentative interpretations in which 
war, accidents, and fire were common. As noted, the influence of current or 
recent events was frequently evident; for example, “The morning newscast’s 
discussion of peace made the possibility of war seem less likely.” Table 3 


ens 
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TABLE 3 
TYPES OF INTERPRETATION 
Anxiety War Accident Fire Experiment Other Total 
level % % % % % % 
5 50 50 0 0 0 100 
4 46 9 27 0 18 100 
3 11 47 16 0 26 100 
2 22 11 0 17 50 100 
1 0 50 0 0 50 100 


shows the per cents of the several interpretations for each level of anxiety in 
terms of the dominant hypothesis held by the subject. | 

Two measures related to hypothesis formation were employed. The first 
was an index of the stability of an hypothesis and the consistency of the 
individual in his structuring of the situation. This measure was based upon 
answers to several questions: (а) the number of hypotheses, (4) the fre- 
quency with which the subject changed his interpretation, (c) the extent to 
which he was influenced by others, and (d) the degree of consistency 
between the hypotheses and the events portrayed during the experiment. To 
convert the several responses into a single score for each subject, each 
questionnaire was rated independently by four judges on a scale ranging from 
1 to 6. Discrepancies in ratings were adjusted in discussion among the 
judges. The 62 subjects were grouped as indicated in Table 4. 


TABLE 4 
HYPOTHESIS CONSISTENCY 

Anxiety Good Moderate Inconsistent, 
level N consistency consistency disorganized 

4—5 17 0 7 10 

3 19 3 9 7 

2 18 5 10 3 

0-1 8 4 2 2 

Total 62 12 28 22 


The relationship between hypothesis consistency and anxiety is negative 
[C = .57 (p < .05)]. In other words, the greater the individual's success 
in developing a structured, internally consistent explanation of events the 
lower his level of anxiety. 

Hypothesis consistency is positively related to a second measure, perceived 
specificity of threat [C = .32 (» < .05)]. The score on this measure was 
based upon responses to three questions: (a) Did a plan or plans of action flash 
through your mind as the experiment progressed? Describe. (^) Assuming that 
this had been a real disaster, on leaving class what would you have done 
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first? where would you have gone? what specific route would you have taken? 
(c) What thoughts of the immediate or distant future flashed through your 
mind as the experiment progressed ? 

One element in the measure of hypothesis consistency was the number of 
alternative hypotheses. To obtain an estimate of its relationship to anxiety, 
it was treated separately as well. The subjects were classified into three 
groups according to the number of hypotheses reported: Group 1 (one 
hypothesis), N — 2; Group 2 (two to four hypotheses), N — 44; Group 3 
(five or more hypotheses), N = 16. A C of .47 (5 < .01) indicates that the 
fewer the number of hypotheses the lower the level of anxiety. These rela- 
tionships are interesting in the light of another set of results that pertain to 
efforts by the subject to verify hypotheses. 


2. The Search for Information 


Persistent attempts to verify hypotheses were characteristic in each replica- 
tion of the experiment. In the classroom Situation, sources of reliable informa- 
tion were of course nonexistent. There were, however, three pseudosources: 
communication with individuals in the class, observations of the lecturer by the 
class, and observations of the class as a whole. 

The tendency of subjects to accept tentatively one hypothesis as the most 
likely even though incompatible with other features of the situation implies 
that under stress any belief that accounts for part or parts of the unknown 
may be uncritically accepted. This state of mind sets the stage for the ready 
acceptance of rumors and underscores the second significant behavioral 
characteristic observed, the search for information. 


3. Communication with Individuals 


As noted earlier, on experimental days planted observers recorded an 
increase in conversation among persons seated together in the classroom. 
Figure 2 indicates the nature of exchanges between a sample of subjects 
seated together, a group within the field of view of the moving-picture camera. 
The following exchanges are typical of the interplay of opinions: "My 
neighbor said it was war and this increased my anxiety," or "My neighbor 
said it was maneuvers from Ellington Field and this calmed me somewhat." 
These ideas and opinions clearly filled a need. They aided in providing some 
structure to an ambiguous situation, serving to confirm or disconfirm existing 
hypotheses. The fact that they could be nothing more than opinions as to 
What was happening seemed not to prevent such conjectures from contributing 
to the individual's increase or decrease of tension. 


123 


AVID, AND М. R. BINNER 


B. B. HUDSON, J. MCD. 


(‘uotsuayatdde ou—g ‘aarsuayaidde 


— = 'ssv[o 9AE9| 0) payueM—p 1uorsuaqoardd 


794 DAB оор Yova urnpra suonvjonb IJL ‘Вләшеә 21njor 


„мош дои oq, 


ШЕ 
SFUL ‘*paqany 
-SIp Aaaa sem ә 


„8иүчзои pres, 


„epssny 
43pm aem рәлетәәр 
pey әм j3ugnoui I, 


„мои 30U оп„ 


,1uappoog 
ue jo зчЗпоцу,„ 


е јо Зицеу 


“yeas рәвәг 
d-uonow ay} jo рјәу ayy и 


NOMOVUGLN] Loa faag 
z 61009761 


Аүрїш—ү 'paqanisrp 3eqaouros—z *paqinisip Аләл 
pur ayy ur juapnis ayy Aq pojrodoi suoneAd1asqo 10 леш 
mpm pojeos sjuapnys әши jo орд v) 


„Зитцзои pres ay, 


„ļłaede Burt Tez 
SI ртдаом at 


зчәшшоо ON, 


и„їчәшшод ON, 


us°ON “pop zeaq, 
'SupKes 3dox ous, 


„8цтАрда pa 
734318 I  *snorxue 
ÁTouod3xo sem I, 


E —— ———31 
„рәүадом 


Атттәл sem aus, 


ирәләриом әм, 


„3ЧӘШшиоо ON,, 


4Suru3Aue aproap 
зои pmoo т, 


пен sd 


ЕЕ 
mun: 


„ƏAFSUƏY 
~aadde рәшәәѕ oy, 


„рәилаоиоә 
-ип рәшәаѕ ƏH, 


„атм рәлетә 
-әр sey erssny, 


„рлом P 
Aes jou prp ays, 


-3uo4j ит 
дочч8гәи шолд 

-зч8та uo 
доччЗтәи wor 

-ззәт uo 
aoqyZtau woaq 


“319s зподу 
:БУЛФШӘН 

uorsuauaadde 
paje2-3Tes 


aəqumu jeog 


-3uo1j ur 
aoqu8rou worz 


-3u8r3 uo 
aoqu8rau wog 
-uaddeu sem jeu 
„рәтадлом upequnisrp взәп8 оз Zupa} -33әт uo 
Aada sem ays, дәл sem oy, *Зиүҳог әдәм ƏMu aoqu8rau шолд 
иїОЧарц Taeag „зем 3T ,8norano 
JO augnoui І, зецм paaepuom т ләл вем ү, -3Tas moqy 
Eperme 
uoTsuayaadde 
рәзвл-утә5 
aaqunu 3вә5 


124 JOURNAL OF SOCIAL PSYCHOLOGY 


The readiness with which these opinions and ideas were accepted and 
responded to emotionally illustrates the ease with which rumors may be 
propagated. Suggestibility was evident in remarks, such as “The neighbor 
in front of me said she saw a strange-looking plane flying around. My 
anxiety increased,” “My neighbor saw something round in the sky through 
the window. I replied that maybe they are testing a new jet." These also 
illustrate the lowering of one's critical ability [see Cantril (4)], for in this 
situation in the absence of outside contact it was evident that no one was 
any better informed than anyone else. The intense desire for information 
was illustrated by the dead silence that fell over the class when toward the 
end of the sound sequence a person entered the room to deliver a note to the 
lecturer. For the class as a whole this was the point of maximum anxiety: 
The importance of the events was being “confirmed,” but what was being 
confirmed remained unknown. 


4. The Nominal Leader 


The second pseudosource of information was the lecturer. In the search 
for a clue as to the meaning and portent of events taking place, he became a 
focus of attention. It was evident that he was observed intensely for evidence 
of anxiety, and that his failure to show it contributed to its reduction within 
the class. “The fact that the professor was not anxious reassured me," was a 
frequent remark in the subsequent interviews. The fact that the lecturer 
clearly had no access to outside information and that his sources were no 
better than those of the subjects themselves did not interfere with his evident 
composure serving a source of reassurance, another example of a lowering of 
the individual's critical ability. 

Suggestive, although not statistically significant, is the degree of dependence 
upon leadership as a function of anxiety. Those Ss rated highest on anxiety 
(Group I) tended to rank “leadership” of first importance as a means of 
preventing panic in the sort of situation to which they had just been exposed 
(50 per cent). To be given information was listed second by this group 
(33 per cent). Among subjects rated second highest on anxiety (Group II) 
the order of importance was reversed, information being considered as most 
important and leadership second (55 per cent ws. 18 per cent). Those rated 
lowest on anxiety ranked other means, such as “To have a friend to rely 
upon,” or “Each person having a job to do” (65 per cent), leadership drop- 
ping to six per cent. 

The reliance, however, upon а person in authority for reassurance is an 
interesting illustration of the development within a group under stress of 
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attitudes of dependence upon a leader and of the leader’s potential power in 
such an unstructured situation. One § remarked that when the lecturer left 
the room, “I felt that our leader had gone—an unpleasant feeling.” 

The fact that there were few instances of overt behavior as long as the 
lecturer remained in the room reflected in part the group’s deference to him. 
There were a few exceptions. One student in a replication of the experiment 
left the lecture hall and, when questioned as to where he was going, sheepishly 
stated a desire to see if he could spot his brother’s plane flying from the nearby 
air-force base. Another subject jumped for the shelter under a table at the 
moment of the flash outside the building, reflecting his interpretation of war 


and an already formulated plan of action. 
5. Projection 


The third source from which the subjects attempted to secure information 
as to the meaning or portent of events was the class as a whole. Here an 
interesting phenomenon routinely appeared, two different questions yielding 
essentially the same results. The first question, “Did you notice any uneasiness 
in the class as a whole during the experiment?” required responses in terms of 
“much,” “some,” or “none.” A marked tendency was evident for the subject’s 
observation of group apprehension to increase with the level of his own 
anxiety. The term projection has been used to designate this process, meaning 
(in other words) the perception of others as experiencing feelings and making 
interpretations similar to one's own. The association between estimated group 
anxiety and the measure of anxiety is .53 (p < .01). 

The subjects were required to estimate graphically the development of 
anxiety over time in the class as a whole as they had for themselves [see 
Figure 1]. The correlation of this graphic measure of group anxiety with the 
graphic estimate of self-anxiety yields a Pearson r of .72, and with the scale 
measure of anxiety a C of .63 (5 < .01). In other words, in terms of both 
measures of group anxiety, those subjects who were apprehensive tended to 
believe that they observed uneasiness within the class as a whole. Those who 
were not anxious perceived it less. 

Consistent with these results are the facts that neither the nonanxious 
lecturer nor planted observers within the class during the experiment were 
able to detect evidence of anxiety. Further, careful study of the moving- 
picture record provided no clues that differentiated the most anxious from 
the least anxious subject. Since the individual subject was able to exchange 
opinions only with his immediate neighbors his observations were to some 
extent at least a projection of his own feelings. This interpretation is supported 
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by the subjects’ reports of what they had perceived as signs of anxiety within 
the class. For example, the hilarious amusement of a small group of students 
who had seen the moving-picture camera and were aware that they were the 
objects of an experiment did not interfere with the development of anxiety 
in other persons seated elsewhere in the lecture room. Instead, their uproarious 
behavior was perceived as evidence of their anxiety. 

Humor in the form of laughing, joking, or jesting about events occurring 
was reported by a number of subjects as evidence of the disturbed state of 
those displaying such behavior. In response to the question, "What were the 
signs of apprehension you observed in the class as a whole?" 42 of the 57 
subjects who answered this question reported either restlessness (18 subjects), 
humor (13 subjects), or curiosity (11 subjects). Other answers were suspense 
and looking around. In other words, there is a wide range of behaviors which 
under stress can be accepted as confirming one’s anxiety. 

The average level of anxiety perceived in the class as a whole was higher 
than the average level of anxiety reported by the individual. This was evident 
from measures derived from the graphs drawn to represent the course of 
development of anxiety in self and that perceived for the class as a whole. 
It was also evident from other measures. This may be a symptom of an 
unwillingness in the individual to admit to the full extent of his anxiety or 
it may reflect a common hypothesis, *Maybe they know something that I 
don’t.” In any case this projection of one’s own anxiety may be significant in 
group situations, for in perceiving anxiety in others one finds confirmation or 
overconfirmation for one’s own undefined fears; thus tending to set up a 
cyclical reaction partially self-sustaining and possibly to some extent self- 
amplifying. In fact, this process may be a factor underlying the speed with 
which some panic reactions develop in groups. Furthermore, the fact that 
none of the methods of direct observation in either of the Rice studies revealed 
overt manifestations of anxiety within the class as a whole suggests another 
possible reason why panic in certain circumstances may break rapidly. Inhibi- 
tion of emotional expression in our culture is socially approved. However, 
when a percentage of a group breaks over into overt activity, social sanctions 
are lowered, with the result that those who had been inhibiting expressions 
are rapidly recruited by the reacting group. 


6. The Nature of the Threat 


From the foregoing it is evident that the inability to organize events in 
one’s environment perceptually in a stable and meaningful way is a factor 
underlying anxiety. Response to the strange or unfamiliar has been considered 
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by Hebb (11), in his observation of the reactions of chimpanzees to strangers 
and strange objects; and by Hudson (13), in studies of rat behavior where 
the typical rodent defense reaction of covering a strange object with shavings 
is evoked. Response to the unfamiliar has been discussed by Bruner (2), in 
his consideration of perceptual learning; and by Cantril (5), in terms of the 
organism’s attempt to come to terms with its environment: i.e., to achieve 
clarity. 

In this experimental situation the combination of events carried the 
implication that something important was happening. The newsboys on the 
campus, a rare occurrence, set the stage for this interpretation. However 
the major hypothesis of many of those who were disturbed did not explicitly 
incorporate this feature, "something important." The hypothesis that students 
at the neighboring university were advertising their “Fiesta Day” did not 
involve it, nor did the hypothesis of fire or accident really qualify. Further, 
none of these would have been announced by a special edition of the news- 
paper. The absence of logic in interpretation fostered exploration and search 
for better alternatives, a trial and check or hypothesis type of behavior, 

It was also evident that threat per se was not necessarily involved at the 
conscious level. Although war-related hypotheses were most frequent among 
those who were most anxious, the hypothesis of fire or accident (neither of 
which directly involved threat) were often expressed by those in the high- 
anxiety categories. 

It is an interesting question as to whether the gradual increase in average 
level of anxiety that continued to the end of this experiment was due (a) to 
repeated failures of hypotheses to remove the ambiguity, reducing events 
perceived from the strange to the familiar; (4) to the steady addition of new 
elements in the sequence of events; or (c) to both. Adaptation to the strange 
and novel may occur quite rapidly (13). The alerting process, a mechanism 
of a most basic sort, has served its biological purpose when it provides the 
organism with the stimulus to explore, to identify the strange, or to determine 
its implications. If there ensues a period of steady state after this initial alert, 
if there are no changes in the perceived character of events, the adaptation 
Process may set in whether or not the stimulus has been recognized or organ- 
ized cognitively into something meaningful. During the civil war in Spain, 
Populations responded at first to the bombing of urban areas with great 
anxiety, taking refuge in available shelters; but with the repetition of aerial 
attacks they came outside to watch. Two men en route to England from 
the United States during World War II were among those rescued from a 
torpedoed vessel. They reported intense anxiety in response to the emergency. 
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Within hours the ship that had rescued them was torpedoed. Their reaction 
to the second experience, though it was equally fraught with danger, was 
reported as quite different. In this instance, they presumably had not only 
partially adapted to this type of situation but, by virtue of the first torpedoing, 
had resolved some of the ambiguities of action to be taken on the second 
occasion. 

The hypotheses formulated during the period of uncertainty may be 
associated with a physical exploration of the stimulus or with checks on the 
logic or internal consistency of interpretations. These activities are highly 
motivated. It is as if to be without an organized view of the environment is 
intolerable; however the response needs to be understood in terms of complex 
intellectual process only insofar as recognition requires such processes. The 
rustle of leaves or the snap of twigs will alert an animal in the woods until 
the ambiguity of its origin is resolved, but the animal does not respond to 
signs of more complex events that alert a human adult. Similarly, children 
accept without anxiety events that are novel in terms of their experience for 
their failure to perceive the ambiguities involved, whether they be in terms 
of cause, implication, or outcome (18). 

It is interesting, and it needs to be stressed, that it is not necessarily a 
threat component contained within an hypothesis that arouses anxiety. Hy- 
potheses may even be such that if correct they would remove the element of 
threat or of danger which at first would appear to be the origin of anxiety. 
The hypothesis that “It is all an experiment” is an example, but that such an 
hypothesis by itself is not sufficient (unless confirmed) is evident in the 
persisting anxiety among those who held it. 

Bruner (2) cites Bogoras, who reports that a Chukchee woman almost 
fainted on finding that he had brought a strange box into her house, objecting 
on the grounds of its strong and malevolent odor. However, instead of infer- 
ring that she behaved so because her hypothesis was that strange things smell 
bad, in terms of these data her emotional response reflected the absence of a 
perceptual organization to give meaning to this unfamiliar object with its 


strange odor. 
7. The Associative Process 


Hypothesis formation, a more primitive process than hypothesis behavior 
described by Kreshevesky (19), resembling in some respect that examined by 
Bruner (2), appears as a high-speed associative or "scanning" process for 
which the rules of frequency and recency do not seem to apply. For example, 
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in a series of experiments with small groups, in which ambiguous sounds were 
presented under conditions which appeared to represent real events in the 
environment, as many as 10 hypotheses have been recalled by subjects after 
exposure to ambiguous sound stimuli of 15-second to 30-second durations. In 
one instance, the stimulus was a recording of the sound of horses trotting on 
pavement, although never identified as such. The sounds came from a 
corner of a large laboratory which was empty except for two subjects present 
ostensibly for the purpose of grading papers. At one point one said to the 
other with overtones of alarm, “What is that?" to which the other responded, 
“Maybe we'd better get out of here.” 

The series of hypotheses rapidly formulated, checked, and rejected by one 
subject was interesting, reflecting associated past experiences of both a general 
and a specific nature. The generalized ones were “I thought of carpenters 
hammering on the floor. Then I thought of students coming down the stairs. 
At some point the idea of repair work on the ventilating system occurred to 
me and then I thought of the architects, but the sounds were coming from the 
wrong direction." One specific hypothesis was significant. This subject recalled 
a momentary scene in a moving picture attended some five years previously 
when she was 12 or 13 years of age—a locomotive was emerging from a 
tunnel. It suggests that some element in this experimental situation was 
common to this earlier flash experience in the moving-picture theater. 

To account for this specific association, since it was not supported by 
recency or frequency, and it is questionable whether vividness per se was 
sufficient to account for its recall, it would appear necessary to postulate a 
one-trial exposure and the principle of identical elements (26) in which 
specific components in the immediate present initiate momentary associative 
elaborations starting with an element or elements common in the earlier 
experience in the theater. The specificity of this type of association should vary 
inversely with the frequency with which the identical elements have been 
experienced in the intervening period. Among adults, one would not expect 
a high frequency of such specific hypotheses, but among children for whom 
there may be many stimuli that still have a one-trial association, specifically 
elaborated hypotheses of the type described should be more frequent. 

It will be recalled that in these experiments with large groups the more 
specific the hypothesis the lower the level of anxiety. Children in their 
spontaneous efforts to organize events may not only formulate more specific 
hypotheses, one-trial learning based; but in evaluating them accept them less 
critically for the lack of other incongruent rejection-forcing experiences. 
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8. Aftereffects 


It had been observed in the exploratory study that for periods in some cases 
for several days, after the experiment, the sounds of planes or fire trucks 
would partially reinstate emotional feelings experienced on the experimental 
day. Therefore, following the main experiment the subjects were questioned 
during interviews as to the associations evoked during subsequent days. Of 
the 61 subjects answering these questions, 43 indicated the experience of such 
aftereffects; 18 did not. "I laughed with relief when I heard planes or sirens 
for several days after the experiment." “Whenever I heard an airplane the 
rest of the day I felt a frightened feeling." 

'The experience or nonexperience of these aftereffects is related to the 
strength of anxiety response [C = .34 (p <.05)]. Since there was a spurious 
element in this relationship, the anxiety measure being based in part upon 
evidence of such aftereffects, the latter is also correlated with the individual's 
uncorrected graphic self-rated anxiety, yielding a C of .33 (р < .05). 

'The immediate aftereffects reported on learning that it had been an experi- 
ment were varied. А number reported an unpleasant letdown ; others, feelings 
of both relief and disappointment: "I was really wanting something to 
happen." 

Another kind of aftereffect was frequently evident: "I felt let down and 
peeved,”—an expression of hostility toward persons associated with the 
experiment. It was sufficiently frequent (11 of 62 subjects reporting this 
feeling) that its relationship to the anxiety measure was examined. Dichoto- 
mizing in terms of presence or absence of this "scapegoating tendency" and 
relating it to anxiety yielded a C of .36 (p < .05). 


9. Efficiency of Recall 


With the availability of a moving-picture record of a small segment of the 
class, a sample of 10 subjects clearly and constantly within the field of view 
of the camera was studied in detail and a relatively complete record made of 
the behaviors of each subject during the experimental period. From these data, 
checklists of overt behavioral acts were prepared.* These items were then 
buried within a list of irrelevant items and presented to each subject for 
recognition of his own overt behaviors. The scores, in terms of percentage 
of items recognized, for the 10 subjects were rank ordered and correlated with 
the anxiety scores, yielding a rho of —.74 (5 < .01). Supporting these 


4 This special analysis was done by Jan Price and Judy Garland at Rice Uni- 
versity. 
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results, the subjects in the class as a whole were also questioned as to whether 
they were aware of memory lapses during the experiment. Based upon these 
verbal reports, a C of .31 (2 < .10) was obtained. 

It is evident that under stress the field of observation narrows, reducing self- 
awareness to varying extents. In terms of the verbal reports, the subjects 
became so involved in the process of attempting to interpret the situation 
that there were numerous instances of feeling after the experiment that there 
were time gaps, a lack of complete continuity in the recall of events. These 
results perhaps can be related also to Ranson’s suggestion (24) that “Panic 
+ . . involves temporary major disorganization of thinking and control of fear, 
Consciousness is usually clouded.” 


10. Other Behaviors 


This type of situation leads to behaviors that, though they do not permit 
statistical treatment, are interesting in demonstrating the range of response 
patterns that may emerge under stress. For example, disassociated physiological 
reactions have been reported. One subject stated in the interview that he had 
not been aware of being disturbed or anxious during the experiment, but he 
admitted to feelings of hostility toward those responsible, With probing it was 
also brought out that he had perspired noticeably more than usual during 
the period of exposure to the sounds. Another subject, somewhat puzzled by 
her failure to experience the anxiety that she had detected in her neighbors, 
recalled that her hands had become very moist, something she had not noticed 
until she started to fill out the questionnaire. 

A question, “What did you do or say to yourself to reduce your appre- 
hension?” called forth a variety of replies. “I started talking to my neighbor 
to forget about what I thought was happening.” Another, “We are all in the 
same boat”; a girl, “I said a short prayer and after that I did not worry 
anymore" ; a boy, reporting the statement of a neighbor, said, “Glad I went to 
church." 

Humor as noted, was often evident: "I shot the bull or, when the 
lecturer left the room, “Maybe the professor has been drafted.” or "Next 
comes the atomic bomb,” and “What’s that? It sounds like space ships." 


11. Ambiguity in Relation to Extreme Behaviors 


In discussing panic behavior Caldwell, Ranson, and Sachs (3) list at least 
six factors that contribute to its arousal. Among them are (a) tension and 
insecurity, (2) imitative behavior, (c) suggestibility, (d) rumot, (e) fear of 
the unknown, and (f) lack of preparation. In terms of these studies, insecurity 
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and tension would appear as aspects of the anxiety response; and responsive- 
ness to rumor, a symptom of the need for organization (the lack of which 
underlies anxiety). Equivalent is suggestibility and similarly symptomatic 
is the wide range of perceptual evidence that was accepted as confirming the 
existence of group anxiety. АП are aspects of the syndrome evident in this type 
of situation. 

As we have seen, the initiating stimulus complex is instability or ambiguity 
in one's environment. It may be a function of one's inability to identify 
objects or events, of inability to determine their meaning or cause, or to predict 
the outcome, or of inability to organize or plan the course of action to be 
taken. All of these imply the absence of relevant past experience to support an 
organized perception. Thus, among the subjects in these experiments in the 
high-anxiety categories, there were some who had formulated clear hypotheses 
concerning what was happening; but who had not evolved plans for action. 
For them the ambiguity was not so much in the meaning of events in the 
immediate situation as in the ambiguity of outcome or of action to be taken 
(15) as was shown by one subject who, having formulated a firm hypothesis 
that the events meant war, was in tears over the uncertain fate of her fiancé 
who was a member of the Reserve Corps. [See, also, Janis (17).] 

The importance of having patterns of behavior available to serve as a 
stabilizing influence is illustrated by numerous examples. The emphasis upon 
training for emergencies, fire drill in schools, and aboard ships is in recognition 
of this principle. It has been reported that panic among troops subject to 
strafing by enemy aircraft during World War II was effectively reduced by 
instructing them to fire at the attacking planes with their rifles or sidearms— 
in spite of the fact that the probabilities were not large that such fire would 
be effective. It is a question as to whether this anxiety reduction was due to the 
action itself or was a function of the cognitive organization provided by 
knowledge of action to be taken in the situation. Troops in panic flight are 
taking action; yet the anxiety, according to anecdotal accounts, is not 
reduced—rather it appears to be enhanced. 

It would appear that any stability in the present environment and one's 
place in it, whether it be narrow to include only a limited area of activity or 
broad to include a clear perception of the total situation, aids in reducing 
anxiety. Children evacuated from London during the war, for a time at the 
cost of breaking up families, clung desperately to their dolls or teddybears 
as the one structured element in their otherwise strange environments [Vp 

No doubt the emphasis upon training for meeting emergencies is correct, 
but possibly for reasons other than to develop behavior patterns per se. The 
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training may provide some stability in situations that otherwise would be 
perceptually or cognitively unstructured. 


12. The Problem-Solving Process 


Recounting the course of events in these experiments, the first response 
characteristically made was curiosity. The stimulus was the newsboys calling 
“Extra.” As new elements were added to the situation, curiosity changed to 
anxiety. We can presume that both the curiosity and anxiety initiate the 
exploratory or problem-solving activity which for the most part took the form 
of covert hypothesis formation. Finally, both tension and activity ceased when 
the problem was solved; that is, when recognition occurred, which for many 
was delayed until they were told that it had been an experiment. 


13. The Issue of Age and the Response to the Unfamiliar 


The first question that arises concerning this sequence of events is the 
curiosity-anxiety response pattern. Its origin could be, as Hebb (12) has 
proposed, neither innate nor a function of learning; but nevertheless dependent 
upon both (6). On the other hand, although young animals do not display 
the signs of anxiety to those unfamiliar stimuli that alert an adult, their 
failure to respond may reflect the fact that by virtue of their limited experience 
a multitude of stimuli are novel, with the result that they are in a constant 
state of generalized adaptation (13, 14). This is a second possibility. Another, 
previously mentioned, concerns hypothesis specificity as a result of previous 
one-trial experiences and less rigorous criteria for recognition in the young 
organism. And finally a fourth relates to the fact that adult animals will 
respond with fear to an unfamiliar object if it is presented against the back- 
ground of a familiar environment; but will be unresponsive to the same 
object in a less familiar setting, the situation which initially exists for the 
very young animal (14). This phenomenon has also been evident in these 
studies. In any case, with age the environment of the young becomes familiar 
and stabilized, so that novel additions to their surroundings are responded 
to with the arousal of the biologically important alerting mechanism of 
curiosity anxiety with exploratory behavior as a concomitant directed at 
identification. 

The anxiety evident in isolation experiments discussed by Haggard (10) 
and the common fear of the dark have interesting parallels to the phenomena 
described here. Absent in isolation situations is stimulation that has connota- 
tions of threat. This, as we have seen, was not a necessary condition for the 
arousal of anxiety in these experiments. The reduction in sensory input during 
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isolation appears to allow the development of unorganized associative processes 
accompanied by anxiety, an observation that parallels the evidence described 
here that confusion of hypotheses is significantly associated with anxiety, 
Parallel also is the phenomenon of adaptation that occurs to both isolation 
and the unfamiliar. 


14. The Issue of Reinforcement 


The second question in this sequence concerns hypothesis formation, the 
covert exploratory behavior. These represented learned solutions to a problem, 
albeit for the most part erroneous in these situations. Here, as noted, explora- 
tion was initiated at the time of failure to identify events or their meanings. 
Referring to the underlying neurological activity it may be, as Hebb has 
proposed, a function of disruptions of phase sequences. This effort at recogni- 
tion (to achieve closure) we infer is unlearned; and the tension reduction 
(the elimination of the nagging urge to identify) instead of serving as rein- 
forcement for successful associations (that is, for learning) is simply another 
aspect at the physiological level of this closure. In other words, a stimulus that 
registers arouses associative processes, intrinsic parts of the perceptual process 
of recognition. Failure of these associative processes enhances these activities 
that we have identified as curiosity or anxiety, the “biogenic drives to know” 
(23). 

These problem-solving activities are primitive and biologically important 
for survival. They resemble in one respect some of the problem-solving activi- 
ties of the laboratory involving ambiguous stimuli: i.e., both require the same 
kinds of exploratory or hypothesis-forming behavior. They differ in that the 
tension compelling the effort may be in part or entirely of social origin in the 
socially imposed task, extrinsic to the stimulus; whereas in the primitive type 
of problem situation with which these studies were concerned the tension is a 
function of the ambiguity itself, the failure at recognition. 

As the foregoing implies, and for the types of situations discussed, the ten- 
sion reduction specifically associated with recognition or closure does not affect 
the learning per se. The subjects in these experiments later recalled both their 
discarded erroneous nontension reducing hypotheses as well as their few 
correct ones. In other words, although these problem-solving behaviors, overt 
or in the form of hypothesis formation, were a function of tensions, the learn- 
ing that occurred was not associated with its reduction. It is the distinction 
between performance and learning where the latter is conceived as cognitive 
and not necessarily a function of reinforcement (27). 

It is difficult to overestimate the importance of this kind of behavior (1). As 
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noted, we tend to identify its milder forms as reflecting curiosity and the as- 
sociated tensions as exciting. It can be pleasurable, in other words, as well 
as disturbing (12, 28). The number of situations that depend at least in part 
for their interest and appeal upon the element of uncertainty is almost un- 
limited. The search for the novel, the exciting, the situations of uncertain 
outcome (although often combined with other sources of satisfaction) is the 
basis of most of our recreation. Games, travel, gambling, and in literature, 
the detective story and the novel derive their stimulus value in part or largely 
from this source. Advertisers early achieved 100 per cent uncertainty with 
the expression “Even your best friend won't tell you" and they have gone on 
from there. The searching out of situations that arouse these pleasant forms 
of mild excitement would appear to be a major preoccupation of many, and 
the number of industries to provide the varieties of entertainment that have 


this common characteristic is growing. 
D. SUMMARY 


This paper describes studies with human subjects which represent exten- 
sions of earlier work on the emotional behavior of animals in response to the 
strange or unfamiliar. The methods employed were in principle similar. 
Subjects in their otherwise normal environments were exposed to sounds 
which were ambiguous in terms of their instrumental origins or of their 
meaning and implications. As with animals, ambiguities in the environment 
have uniformly aroused responses in humans that lie along what appears to be 
a curiosity-anxiety continuum. 

In one series of studies intermittent sounds of fire equipment and aircraft 
were presented to captive audiences over 25-minute periods. The data obtained 
were derived from moving-picture and observer records made during the 
experiment and from interviews and questionnaires following. The common 
interpretations made in these instances were war, threat of war, fire, and 
accident. Anxiety was typical, a response which was correlated with a num- 
ber of other measures: with perception (projection) of anxiety in the group 
as a whole (р < .01), with suggestibility ( < .10), with “scapegoating” 
(p < .05), and with failure in retention (Pp < .01). | 

А covert pattern of responses that always occurs оп exposure to ambiguous 
events is the formulation of hypotheses in attempts at recognition. These are 
rapid associative processes often involving remote and seemingly irrelevant 
experiences. Anxiety was correlated with their degree of confusion (p < .01 ), 
but zot necessarily with the seriousness of the inferred threat. In the experi- 
ments cited, different methods were employed in efforts to relieve anxiety: 
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e.g., prayer and humor. Mild anxiety was aroused in some subjects by sounds 
of fire trucks and aircraft for several days after the experiments, responses 
which were related to the amount of anxiety aroused during the experiment 
@ « 205). 

The learning associated with response to the strange is discussed from the 
point of view of traditional drive-reduction reinforcement-learning theory. 

Methodologically this approach, in using ambiguous stimuli as the inde- 
pendent variable, might appear as a distant cousin of the projective test. This 
is not the case. It differs in the first place in that the subject's focus of effort 
is upon resolving the ambiguities through establishing stimulus identity. Al- 
though both involve associative processes, in these studies the associations are 
stimulus controlled and relate primarily to the subject’s objective, environ- 
mental past, rather than to underlying motivational or subjective components. 
It also differs from the projective test in that the ambiguous stimulus is in 
fact the independent variable, and the response itself is the focus of interest. 
In the projective test on the other hand, it is the reverse: The independent 
variable is the unknown, to be inferred from the response; and the ambiguous 
stimulus patterns (e.g., the TAT card), a pseudoindependent variable, serves 
primarily as the medium or the occasion for the indirect expression of underly- 
ing motivational dispositions, 
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SOME CORRELATES OF THE DON'T-KNOW RESPONSE 
IN OPINION QUESTIONNAIRES* 1 


Center for Research on Social Behavior, University of Delaware 


Rosert C. ZILLER AND BanBanA H. Lone 


A. INTRODUCTION 


'The opinion-rendering process may be viewed as a special case of decision 
making; a judgment is required under conditions of information limitations 
and precluding information search. Given these information restrictions, at 
least two components of the opinion process are suggested : the extent to which 
the respondent weighs the adequacy of his existing information resources as 
reflected in (a) the decision whether or not to withhold judgment or (b) the 
degree to which the respondent qualifies the judgment rendered. Operationally, 
the decision maker may or may not (a) choose the don't-know alternative 
in view of the limitations of information at his immediate disposal or, (b) if 
a judgment is made, the respondent may or may not prefer to employ a six- 
alternative opinion scale to a three-alternative scale (8). This paper is con- 
cerned with the first of these components. 

In the area of decision making, a process comparable to this first component 
of the opinion process has recently undergone extensive analysis under the 
subject heading of information search (5, 6). The usual design permits the 
subject to gather additional information, usually at a given Cost, before sub- 
mitting a final decision. In the opinion process, however, it is usually assumed 
that the judgment must be submitted on the basis of existing information 
resources. Moreover, the possibility of the indefinite deferment of judgment 
is rarely an alternative in decision making situations. 

А number of studies have been reported (2, 3, 4, 9, 11) which are relative 
to withholding judgment. Usually, however, the measures of withholding 
judgment involve attitude items (2, 9) or items from a personality question- 
naire (4) ; and even though an I-don't-know alternative or a “?” alternative 
information limitations are not a central consideration, Responses 


is included, r a | | 
those indicating the withholding of judgment are implicitly 


other than 
requested, 
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cation requires more general content. 

Withholding judgment under conditions of information inadequacy was 
assumed to be a component of dogmatism, However, because of the myriad 
of meanings associated with the term “dogmatism,” this invitingly convenient 


The two studies reported here were designed to analyze the relationship 
between Rokeach’s Dogmatism Scale to withholding judgment and the 
relationship between withholding judgment and the respondent’s age, sex, 
education, and intelligence, 


В. Srupy I 


In the previously described study by Ziller ez al. (14), it was found that 
professionals as opposed to trainees in clinical psychology used the don't-know 
response less frequently in rendering professional judgments under conditions 
of extremely limited information and when a comparison was implied between 
the performance of Professionals and trainees, It was assumed that under 
conditions permitting invidious comparisons between the decision-making be- 
havior of professionals and trainees, the decision to defer judgment because 
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the don’t-know response may suggest one or more of a variety of personal 
shortcomings. In this first study, the generalizability of the earlier findings 
to the conditions of age as a status variable was tested. 


1. Procedure 


The subjects were 120 male and female students attending the summer 
session at the School of Education at the University of Delaware. Both 
graduate and undergraduate students were included in the sample of subjects 
whose ages ranged from 19 to 59. 

The opinion questionnaire was adapted from the instrument used in a study 
by Ziller and Mangano (13). The original 16 items were retained. Examples 
of the items are “There is life on other planets,” “The world will always 
have wars,” “Modern art will, in time, take its place beside classical art.” 
The total number of don’t-know responses provided the measure of with- 
holding opinion under conditions of information inadequacy. 


2. Results 


The reliability of the withholding-judgment scale was estimated by means 
of a split-half technique. The resulting correlation coefficient corrected for 
length was 0.58 (N = 120). 

The withholding-judgment scores were analyzed according to three age 
categories (under 30; 30 through 39; and 40 and above) and according to 
the sex of the respondent. These age groupings were designed to focus upon 
the middle age category, 30 to 39. Furthermore, earlier studies had revealed 
that this categorization system provided approximately equal numbers of 
subjects in each category with regard to this particular population (12). The 
requirements of proportionate number of subjects in each of the six cells of the 
age-by-sex analysis of variance dictated the elimination of 30 subjects through 
random procedures. The subjects retained represented the maximum number 
of subjects available in the sample under the statistical requirement of pro- 
portional number of subjects per cell. All subjects were either teachers or 
prospective teachers. 

"The results with regard to the main effects of age and the age-by-sex inter- 
action were statistically significant (5 < .02—see Tables 1 and 2). The 
youngest group (those under 30) elected the don’t-know response most fre- 
quently; whereas the middle age group (30 to 39) chose this response least 
frequently. The age-by-sex interaction was most pronounced between the 
youngest and oldest age categories. In the youngest group, the females 
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TABLE 1 
MEANS OF THE FREQUENCY OF DoN'r-KNow RESPONSES 
ACCORDING TO SEX AND AGE 


Under 30 30-39 Above 39 Total 
Sex N Mean N Mean N Mean N Mean 
Male 12 2.58 12 1.50 12 3.42 36 2.50 
Female 18 4.00 18 2.44 18 2.22 54 2.88 
Total 30 3.43 30 2.06 30 2.70 90 2.72 
TABLE 2 
ANALYSIS OF VARIANCE OF THE FREQUENCY ОЕ DoN'r-KNow Responses 
Source of Mean 
variance df square F 2 
Аре 2 14.00 4.22 .02 
Sex 1 4.00 1.20 — 
Interaction 2 13.50 4.07 02 
Error 84 3.32 


submitted the greatest number of don’t-know responses; in the oldest group, 


the males submitted the greatest number of don’t-know responses. 


3. Discussion 


The results demonstrated that the middle age group selected the least 
number of don't-know Tesponses more often than the other age groups. To 
the extent that the 30-to-39 age group of teachers may be assumed to be the 
most concerned about status or power or potential power in their education 
organizations, the results may be interpreted as supporting the initial hypothesis 
relating status defense and dogmatism. 

The interaction effects between age and sex with regard to the don't-know 
Tesponse was paradoxical. In the age group under 30, the males made fewer 
don't-know responses than females; in the age group above 39, the females 
made fewer don't-know responses than males. Insufficient information con- 
cerning the differences in the samples of subjects in the two age groups 
counsels the authors to withhold judgment. 


С. Srupy II 


The results of the first study were interpreted in terms of the hypothesis 
relating status and dogmatism; but the results were scarcely conclusive. In 
the second study, several shortcomings of the initial study were corrected ; 
the sample of subjects and the length and content tange of the opinion 
questionnaire were increased. In addition, the relationships involving with- 
holding judgment were analyzed with regard to Rokeach’s Dogmatism Scale 
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and intelligence. Finally, the variable of educational level (freshman, sopho- 
more, junior, senior) was substituted for the somewhat indistinct status 
variable of age. The sex categorization of subjects was retained in the 
analysis. 

In a recent study (1) it was hypothesized that there is a “progressive 
decline in dogmatism with advancing age from Grade 8 to 12.” The 
hypothesis was derived from Loevinger’s theory of ego development (7). She 
contended that the adolescent learns to “assume distance from oneself.” Thus, 
in early adolescence, the self-appraisal tends to be stereotyped, behavior which 
is also characteristic of dogmatism. Using an abbreviated form of Rokeach’s 
Dogmatism Scale, the hypothesis was supported. 


1. Instrument Revision 


Analysis of the withholding-judgment instrument used in Study I revealed 
that future-oriented items elicited the greatest number of don’t-know responses. 
Thus, in increasing the length of the instrument for purposes of improved 
reliability and validity, the new items were largely future oriented. More- 
over, in an effort to avoid a possible subject-matter bias, we added a number 
of items from a wide variety of subjects including the following: (а) By 1980 
the world’s food supply will come largely from the sea; (b) The world econ- 
omy would eventually be improved if all tariffs were eliminated; (c) Children 
who move frequently are not as well adjusted as those who grow up in one 
place; (4) By 1980, 90 per cent of the American population will be living in 
cities of 50,000 or more. 

A total of 38 items comprised the general opinion questionnaire used in 
Study II. The subjects were asked to circle one of the three alternatives with 
regard to each statement: (а) Agree, (b) Disagree, (c) Don't know. Again, 
the total number of don't-know responses comprised the withholding-judg- 


ment score. 
2. Validation as a Measure of Dogmatism 


Heretofore it has been assumed that withholding judgment under the 
conditions of the questionnaire already described is related to dogmatism. 
In order to test this assumption, withholding-judgment scores were related 
to scores on Rokeach's Dogmatism Scale. With regard to a sample of college 
women (№ = 72), r = —.32 (р < .01) ; with regard to a sample of summer- 
school students, largely teachers (N = 52), = —.27 (p < .05) ; and with 
regard to a sample of elementary-school teachers (N = 44), r = —47 


( < .01). 
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Although these correlation coefficients are low, they are consistent and sta- 
tistically significant. In comparison with the Rokeach Scale, which derives 
from an inchoate theory of personality, the withholding-judgment scale has the 
advantage of being operationally defined, objective, and emanating from a 
low-visibility response-set approach to personality measurement. 


3. The Relationship of Withholding Judgment to Intelligence 
and Academic Status 


In the second study proper, the subjects were 589 regular, undergraduate, 
male and female students from the departments of education and history at 
the University of Delaware. Included in the sample were 165 freshmen, 173 
sophomores, 147 juniors, and 104 seniors. 


4. Results 


In exploring the relationship of the two opinion-process measures with 
intelligence, the correlation coefficient of the verbal subtest of the SCAT and 
grade-point average with the withholding-judgment scores of the freshmen 
women sample (N = 134) were —0.14 (5 < .10) and —0.24 (p < .01) 
respectively. Thus, more capable students tended to select the don’t-know 
alternative less frequently; but the results were more highly related to the 
more visible indicator of ability: grade-point average, 

With reference to the analyses involving educational level and the sex of 
the respondent, simple analyses of variance with respect to educational level 
were calculated separately for males and females since a small and dispro- 
portionate number of males were included in the sample. The results are 
reported in Table 3. 

The results with regard to males and females were very similar but were 
statistically significant only with respect to the females (F = 4.08; df = 3, 

TABLE 3 


MEANS OF THE FREQUENCY or DoN'r-KNOW RESPONSES 
ACCORDING TO EDUCATIONAL LEVEL AND SEX 


Educational Males* Females** "Total*** 
level N Mean N Mean N Mean 
Freshman 31 10.9 134 10.1 165 10.3 
Sophomore 46 9.2 127 dr 173 9.6 
Junior 34 8.0 113 8.1 147 81 
Senior 37 10.0 67 9:5 104 9.6 
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440; р < .01). Pooling the data regarding both males and females, the 
results were also statistically significant (р < .001). 

The results describe a decreasing frequency of the use of the don’t-know 
response through the junior year in college. In the senior year, however, the 
subjects were found to use the don’t-know response more frequently again. 
Overall, however, the statistical results were largely attributed to the 
difference in the responses of juniors who differed significantly from members 


of all the other classes. 


D. Discussion 


The significant and consistently negative relationships between a with- 
holding-opinion index and Rokeach’s Dogmatism Scale support the hypothesis 
that avoidance of the don’t-know response under conditions of information 
inadequacy is an indication of dogmatism. 

The results of the studies involving professionals and trainees in clinical 
psychology, teachers in various age groups, and college students varying in 
academic status indicate that persons possessing relatively high status under 
conditions of instability or threat to the group structure tend to respond to 
an opinion questionnaire in accordance with perceived status and group- 
position expectations concerning their knowledge. More succinctly, it is 
proposed that dogmatism is a status-defense mechanism. Thus, clinical psy- 
chologists, when their responses were to be compared with trainees in clinical 
psychology, used the don’t-know response less frequently under conditions 
when “don’t know” was the most appropriate response. Teachers in the 30-to- 
39 age group as compared with younger and older teachers tended to eschew 
the don’t-know response under conditions when they presumably possessed 
insufficient information upon which to render a reliable opinion. Then, too, 
college students at the junior level tended to use the don’t-know response 
less frequently than members of the other classes under conditions similar to 
those under which the preceding results were observed. Finally, the results 
relating two academic measures of ability with the withholding-judgment 
index suggest, albeit highly speculatively, that the highest negative correla- 
tion (r = —.24, p < .01) between mental ability and dogmatism was found 
with regard to the most patent index of academic ability (assuming that grade- 
point average is more widely known and understood throughout high school 
and in later academic years than SCAT achievement scores). Here, too, then, 
it is proposed that the tenuous status with regard to school achievement is 
defended by conforming to the expectations of a superior student; that is, 
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the expectation that a superior student is knowledgeable and therefore need 
not have recourse to the don’t-know response. 

The marked rise in the don’t-know responses when comparing juniors and 
seniors is especially remarkable in view of the attenuation effects of the 
greater intelligence of seniors through selective screening processes. (It will 
be recalled that a low negative correlation was found between intelligence and 
frequency of the don’t-know response.) It is proposed that the diminishing 
frequency of the don’t-know response from the freshman through the junior 
year reflects the student’s perception of status within the college community. 
The increase in don’t-know responses in the senior year is interpreted as 
reflecting the senior student’s increasing awareness of his impending transition 
from the transient college community to the society in which he must find 
a relatively permanent position. Thus, it is proposed that the senior student 
responds to the opinion questionnaire in terms of his newcomer role with 
regard to the host society rather than in terms of his status within the 
transient donor college community. Although the senior student has high 
status by virtue of his education level within the college community, within 
the working community of which he is an imminent newcomer, a college 
education is often only a basic requirement of admission. 

In conclusion, the results lend support to the component approach to the 
study of the opinion Process and indicate caution in the interpretation of 


E. Summary 


Two experiments were reported investigating some of the correlates of the 
don't-know response in a general opinion questionnaire. The don't-know 
response or withholding judgment was proposed as basic component of the 
decision-making process involved in questionnaire-response behavior. The first 
study involved male and female teachers in three age groups: under 30, 30 to 
39, and 40 and above. The second study involved freshman, sophomore, junior, 
and senior college students of both sexes. It was found that the middle age 
group and students in their junior year used the don’t-know response least. 

Interpreting the avoidance of the don’t-know response as an indicator of 
dogmatism, it was proposed that dogmatism is a status-defense mechanism, In 
addition, the relationships are reported of withholding judgment with 
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Rokeach’s Dogmatism Scale and measures of intelligence. Implications for the 
interpretation of questionnaire data are discussed. 


12. 


13. 


14. 


REFERENCES 


ANDERSON, C. C. A developmental study of dogmatism during adolescence with 
reference to sex differences. J. Abn. & Soc. Psychol., 1962, 65, 132-135. 

DuxzrrE, M. D., UPHorr, W. H., & Aytwarp, M. The effect of lack of informa- 
tion on the undecided response in attitude surveys. J. Appl. Psychol., 1956, 40, 
150-153. 

Epwarps, A. L., & Diers, C. J. Social desirability and conflict. J. Soc. Psychol., 
1962, 58, 349-356. 

Foxpa, C. P. The nature and meaning of the Rorschach white space response. 
J. A bn. & Soc. Psychol., 1951, 46, 367-377. 

Irwin, F. W., Smitu, W. A., & MayvriELD, J. F. Tests of two theories of decision 
in an expanded judgment situation. J. Exper. Psychol., 1956, 51, 261-268. 

LawzETTA, J. T., & Kanarerr, V. T. Information cost, amount of payoff, and 
level of aspiration as determinants of information seeking in decision making. 
Behav. Sci., 1962, 7, 459-473. 

Loevincer, J. A theory of test response. In Invitational Conference on Testing 
Problems. Princeton, New Jersey: Educational Testing Service, 1958. 

Lone, B. Н. & ZiLLER, К. C. Qualification of judgment: A learned cognitive 
style. Percept. & Mot. Skills, 1964, 18, 235-238. 

Mirzett, Н. E. Ropinowitz, W., & OSTREICHER, L. M. The effects of response 
sets on the validity of Minnesota Teacher Attitude Inventory. Educ. & Psychol. 
Meas., 1956, 16, 501-515. 

Коквлсн, M. The Open and Closed Mind. New York: Basic Bks., 1960. 

RosENBERG, N. Izanp, C. E., & HoLtanver, E. P. Middle category response: Re- 
liability and relationship to personality and intelligence variables. Educ, & 
Psychol. Meas., 1955, 15, 281-290. 

ZILLER, К. C., & EXLINE, R. V. Some consequences of age heterogeneity in decision- 
making groups. Sociometry, 1958, 21, 198-211. 

ZILLER, К. С., & Maxcawo, D. Cognitive style and interpersonal perception. Un- 
published paper, University of Delaware, Newark, 1962. 

илк, К. C., SHEAR, Н., & DeCencio, D. Dogmatism—A professional response 
set. J. Clin. Psychol., 1964, 20, 299-303. 


Center for Research on Social Behavior 
University of Delaware 
Newark, Delaware 


The Journal of Social Psychology, 1965, 67, 149-162. 


PERCEPTUAL ACCENTUATION AS A PROBLEM IN 
THE PSYCHOLOGY OF JUDGMENT* 1.2 


Department оў Psychology, University of Miami 


Raymonp E. HARTLEY 


A. INTRODUCTION 


It is likely that the most widely believed hypothesis for the explanation of 
perceptual accentuation is the one that attributes the overestimation on the 
part of the subject to an effect upon his phenomenal experience; that is, he 
overestimates because the object looks larger. Yet if we look at the current 
evidence we find not a single study of accentuation in which the experimenter 
has attempted to get reasonably direct evidence that the effect is visual. But 
surely, considering the very considerable gap between the subject’s presumed 
phenomenal experience and his response, there must be ample scope for more 
plausible (though more pedestrian) explanations. 

For hints toward such alternate explanations we would do well to examine 
the experimental paradigm of the accentuation studies. It is easy to state. The 
subject is required to judge some attribute, call it “A,” of an object. Over a 
series of judgments the experimenter systematically varies some other attribute 
of the object, call it "B." Accentuation is shown when the variation in B 
has affected the judgments of A in comparison to what they would have been 
if B had been held constant. In current experimentation B, which we might 
call the “disturbing variable,” is usually the value or the interest of the object. 
This restriction is not necessary—B might as well be any variable the experi- 
menter may find of interest. 

From this standpoint it is plain that we are dealing with a problem in 
psychophysics. These are studies that are busy examining the effect of a 
disturber variable upon the judgments of another variable. The general 
strategy of such investigation should address itself to the following points: 

1. Do disturber variables affect judgments? 
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2. If so, are some types of disturber variables more effective than others? 
Perhaps some types are not effective. 

3. Having found an effective type of disturber, what is the precise state- 
ment, or law, of its effect? 

Before the first perceptual-accentuation study made its appearance, Point 1 
could have been answered in the affirmative. It has long been known that a 
disturber variable affects the judgment of some other variable. When judging 
weights by the method of constant stimuli it was customary to use several 
standard weights because the standard weight would warm toward body 
temperature through repeated handling, and it was known that difference of 
temperature would affect the judgment of weight. The older psychophysics 
was perfectly persuaded that any extraneous property that was allowed to 
vary would affect the judgments and routinely avoided these effects by requir- 
ing that the objects to be judged be identical in all respects save the one 
property that was of interest. 

The particular contribution of the perceptual-accentuation studies has been 
to introduce as disturber variables value or interest instead of the sensory 
variables the effects of which the older psychophysics was so well aware. In 
addition, this alteration implies a wish to study the disturbing effect; whereas 
the early psychophysics wished to eliminate such disturbances, not study them. 
This changed emphasis brings Points 2 and 3 to the fore. Since value, as a 
possible source of disturbance, seems to be a very different matter than a 
sensory variable, such as temperature or color or size, it immediately becomes 
a question for investigation as to whether both types of disturbers disturb. 
We can assume that the older psychophysics showed that sensory variables 
serve as disturbers, and I believe that the perceptual-accentuation studies have 
shown that social variables also serve as disturbers. If so, the weight of interest 
should now turn to Points 2 and 3. 

To approach these questions, I believe that the paradigm for an accentuation 
study should be changed from its present qualitative form to a quantitative 
form by requiring that the judgments be made in such a way as to be subject 
to psychophysical scaling. We could then get the scale values for the A vari- 
able when the disturber variable was absent, get them again (call them A’) 
when the disturber variable was present, and scale the disturber variable 
itself. The precise answer to Points 2 and 3 would be at hand when the best 
fitting form of the function 

4';=f(A;, В) [1] 
was known. Here “А” and “А”” are as specified above, "B" is the value of the 
disturber variable, and “j” is the designation of the object being judged. To 
express the function precisely would be the answer to 3, and the answer to 2 
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would follow. If, for instance, [1] had a different form when B was a sensory 
variable than it did when B was a social variable we would know at once 
that 2 was incorrectly expressed. Social variables would be seen to be operating 
differently than sensory variables. If the form of [1] was the same in both 
cases then the appropriate constants of [1] could be used as measures of 
relative efficiency. 

I argue, then, that the current situation in the field of perceptual accentua- 
tion necessitates that we turn our attention to the discovery of the form or 
forms of [1]. Let us drop our efforts to show that social variables do disturb 
and try instead to determine the manner in which the disturbance is ac- 
complished. The present study is meant to exemplify this intent. The variable 
“A” to be judged is the area of geometric figures; “А”? is to be the area when 
judged in the presence of a disturber “В”; “В” is to be judged monetary value 
associated with the figures; and the primary object is to specify the form of [1]. 


B. PROCEDURE 


The subjects used were 124 volunteers from introductory-psychology classes 
at the University of Miami. They were assigned to one of four groups, two 
experimental and two control, in rotation as they appeared for testing. 

It was decided that a convenient “A” variable for judging would be the 
area of simple geometric silhouette figures. As a convenient source these were 
selected from Birkhoff’s Aesthetic Measure (1). While perhaps of no im- 
portance as a control, the figures selected were of the same aesthetic measure. 


They are shown in Figure Ís 


м Aw 
gh я 


FIGURE 1 


THE SEVEN BIRKHOFF FIGURES үунісн WERE USED AS STIMULI 
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1. The Control Group 


The contro] group was required to Judge, for each of the 21 slides, which 
of the figures was larger. The slides were Projected by an SVE Schoolmaster 
Projector on a screen 40 cm away. An Airequipt magazine was used with the 
Projector to keep the slides in a fixed order for projection. 


One-half (N — 31) of the control subjects were required, for each of the 
21 presentations, to signify by saying "right" or “left” Which of the figures 
they thought was the larger (the greater-than group). The other half of the 
control subjects signified by "right" or “left” which of the two figures they 
thought was the smaller (the less-than group). 


The figures were redrawn, all on Separate sheets, and were all drawn to be 
the same area, An 84-presentation pair-comparison schedule was drawn up in 
accordance with the Tecommendations of Ross (4). This schedule was pre- 


— ya í 
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an average monetary value and a distribution of values over which its 
monetary value could range. This distribution of values was the same for all 
figures, so that in its 24 appearances a given figure had its average value 
assigned six times; a value two cents above its average, four times; a value 
two cents below its average, four times; a value five cents above, two times; 
a value five cents below, two times; a value nine cents above, two times; and 
a value nine cents below, two times; a value 13 cents above, once; and finally 
a value 13 cents below, once. The average values were set so: for A, 14 cents; 
for B, 16 cents; for C, 18 cents; for D, 20 cents; for E, 22 cents; for F, 24 
cents; and for G, 26 cents. The rank ordering of the objects for area was set 
80 as to correlate zero with this ordering of average monetary value. Finally, 
the foregoing distribution of values for a figure was assigned to the 24 
appearances of the figure in a random fashion. 

After being given his initial 50 cents, the subject was told that he would 
sce a series of pairs of figures and that on each pair he was to guess which was 
the more valuable. If he guessed correctly he would be given the difference 
in their values; if he guessed incorrectly the difference in their values would 
be taken away. It was his job to learn what the values of the figures were, 
and obviously the faster he did the more he could win. А 

Although the subjects tended to look for complex rules that would enable 
them always to guess correctly, it is clear that the problem was so set up that 
all they could achieve was to form an impression of the rank order of average 
values. The forming of this impression was the aim of the “experiment,” and 
after the first 10 subjects a standard procedure of continuing each subject 
through the series of 8+ as many times as necessary for him to win three 
dollars was adopted. By the time a subject had won three dollars he seemed 
to have a good but not letter-perfect knowledge of the rank order of value. 

After the subject reached the criterion of winning three dollars, he was 
told that the experiment was done satisfactorily. The projector was then 
returned to the beginning of the series of 84 and, without any exchange of 
money the subject was asked to signify (for the first 21 slides) which of the 
pairs was the more valuable. Having, hopefully, built up the concept of value 
in the subject by the preceding training the purpose was to make Observations 
on the nature of each subject's concept of that value. 

Finally, each subject judged areas—as had the control subjects. At this 
point the experimental subjects were divided into two groups as with the 
control subjects: the greater-than and the less-than groups. Thus the control 
group, divided into two, judged the area of the seven figures. The experimental 
group first experienced a concept-formation experiment designed to cause them 
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to attach monetary values to the figures. After which they judged these 
figures for area as did the control group. 


C. ANALysis or THE PooLED Groups 


As the first step of analysis, the two control groups were pooled as were 
the two experimental groups. We thus begin with three tables of proportions.? 
One table gives the proportion of judgments "greater than" for the monetary 
values of the seven stimuli, as given by the experimental group. One table 
gives the proportion of greater-than judgments for area, as given by the 
control group. And one table gives the proportion of greater-than judgments 
for area, as given by the experimental group. The less-than judgments given 
by half of each group were reversed for pooling. 

'These data were scaled by the method of pair comparisons under Case III 
of the Law of Comparative Judgment (3, Chapter 7). These scale values 
are presented in Table 1. 

TABLE 1 


SCALE VALUES FOR AREA AND MONETARY VALUES FOR THE POOLED EXPERIMENTAL 
AND CoNTROL GROUPS 


Area Area Value 
Object Experimental Control Experimental 
A 1.033 459 0 
B 1.062 987 1.323 
C 2.508 2.282 1.075 
D 2.123 1.920 1.796 
E 0 0 1.910 
F 275 .677 2.654 
G 1.614 1.375 3.471 
x2* 9.52 18.00 6.40 


* At the .01 level and nine degrees of freedom, chi square = 21.666. 


The chi-square entries in the table are the chi-square values resulting from 
the Mosteller consistency test (3, pp. 167-168). Since these do not reach the 
one per cent level of significance, which is recommended by Guilford, it 
appears that these scale values are consistent with the obtained proportions, 

The scale values for area differ considerably between the experimental and 
control groups. This, of course, is exactly what we are interested in, the first 
problem being how to test this for significance? Since the two sets of scale 
values are derived from tables of Proportions, what we need is a test that gives 


3 These tables have been deposited as Document number 8417 with the ADI Auxil- 
iary Publications Project, Photoduplication Service, Library of Congress, Washington 
25, D. C. A copy may be secured by citing the Document number and by remitting 
$1.25 for photoprints, or $1.25 for 35 mm microfilm. Make checks or money orders pay- 
able to Chief, Photoduplication Service, Library of Congress, 
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an overall test of significance between two tables of proportions. Funda- 
mentally, the Mosteller test of consistency does this. This test tests the 
significance between a set of proportions obtained from the scale values and 


the obtained proportions. 
For the case in which both sets of proportions are obtained proportions 


it appears that the degrees of freedom are: 
k(k—1 
TU d 
2 


[2] 


where £ is the number of stimuli. 
With seven stimuli the degrees of freedom as given by [2] are 20. The 


chi square between the two tables of proportions is 50.112. This, then, is 
significant beyond the .01 level. 

Now, what will be a plausible form for [1] to describe an accentuation 
effect? The following seems to express the customary view concerning per- 
ceptual accentuation: 
A’; — ed; + dB, + k [3] 
Here the 4’ values are regarded as basically linear transformations of the 4 
values plus a small increment from the judged monetary value В. Тһе quantity 
dB, the accentuation term of [3], causes the values 4’ to be inflated over 
what they would have been, and this inflation is larger the larger Bis. It was 
this type of relation that Bruner and Goodman (2), in the original coin- 
judging experiment, thought they had discovered when they plotted the 
amount of overestimation against the values of the coins and found, with the 
exception of the 50-cent piece, an increasing function of overestimation with 
= need, now, a method of determining e, 4, and k that gives a best fit 
of [3] to the whole data. Noticing that [3] is in the standard form of a 
multiple-regression equation we compute the correlations TA'A TA'B and rag 
over the seven stimuli as the sample and proceed in the usual fashion.* 


The resulting equation, for our data is: 
A’; = 1.110 Ay + .031 Bj — .059 [34] 


While [3a] is the best fit that an equation of the form of [3] can make to 
these data it may, nevertheless, not be a good fit. It appears that this matter 
can be settled as follows. If, for a given stimulus, we substitute in [3a] the 
obtained values for the 4; and B; the equation yields a predicted or "theo. 


ae 
4 This method was suggested by Quinn McNemar. 
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retical” value of A’;. These seven "theoretical" A’; values appear in the 
second column of Table 2 (the first column of Table 2 contains the obtained 
4’; values for comparison). Now we take the "theoretical" scale values derived 
from [3a] and, behaving as if they were our obtained scale values, we work 


TABLE 2 
PREDICTED SCALE VALUES FOR A’; DERIVED FROM EQUATIONS (3a), (4a), (4b) 
A’; values 
Object Original (3a) (4a) (4b) 
A 1.033 450 1.029 499 
B 1.062 1.078 1.094 1.074 
c 2.508 2.507 2.522 2.483 
D 2.123 2.128 2.092 2.089 
E 0 0 0 0 
Ё 775 775 252 737 
G 1.614 1.575 1.623 1.496 
х2* 36.61 8.89 34.90 


* At the .01 level and nine degrees of freedom, chi square = 21.666. 


the scaling procedure backwards to get a set of theoretical Proportions implied 
by these scale values. This set of proportions is then tested against the actual 
set of proportions, by Mosteller’s test. The chi square is given in Table 2 
and is well beyond the .01 level, Clearly, then, these theoretical scale values 


do not agree with the empirical proportions and [3a] cannot be regarded 
as giving a satisfactory fit to the data. 


Thus the customary concept that the amount of accentuation is proportional 
to the value of the object has f. 


ailed in the context of this experiment. The 
question now is whether a more complex function of В can be found that will 
do the job. 


Inspection of the data led to the impression that value was effective, as a 
disturber, as a function of its "extremeness" rather than the absolute value. 
Furthermore, it seemed. that the effect was not merely proportional to the 
degree of "extremeness" but increased rapidly with increasing extremeness, 
This impression was implemented as follows. 

The central tendency of the Bs was computed. Here the value of Object 
A was quite deviant; so it was dropped and the central tendency 
as the mean of the remaining six values. This mean wa 
deviate scores of each object were then figured and appear in Column 1 of 
Table 3 under the heading E*, These scores were supposed to represent 
“extremeness” of value. The antilogarithms, to base 10, of these E*, 
were then determined and appear in Table 3 under 


was taken 
5 2.038. The absolute 


values 


the heading Q;. The 


E. 


N 
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TABLE 3 

CERTAIN FUNCTIONS OF THE В; VALUES as Usep IN Equation (4a) 
Object E*; о, 

A 2.058 109.100 

B 715 5.188 

c 963 9.183 

D 242 1.746 

E 128 1.343 

Е 616 4.130 

G 1.433 27.100 


intention of these Q; values was to represent the supposed increasing effective- 
ness of the more extreme values. The law to be tested is, then, 


43= ed; +40; + k [4] 


Although the О; values are certainly not linearly related to the B; values, 
still [4], itself, is linear and can be fitted to the data just as before. We get 


A’; = 1.088 4, + -0049 Q; — .005 [4a] 


The predicted scale values given by [4a] appear in Table 2 and, by the 


Mosteller test, are certainly a good fit to the data. 
In view, however, of the small absolute size of d we must wonder if the 


accentuation term is necessary. Perhaps only the linear transformation portion 
is needed and the accentuation term adds nothing of importance. In other 
words, the abridged equation [4b] is all that matters. 

A’; = 1.088 4; — .005 [4b] 


The “theoretical” 4’; values obtained from [4b] are in the last column of 
Table 2. The chi square of the Mosteller test is beyond the .01 level; hence 
[4b], by itself, is not sufficient and we can regard the accentuation term of 


[4a] as significant. 
D. ANALYSIS OF THE GREATER-THAN AND Less-THan Groups 


The experimental group was composed of subjects one-half of whom, on the 
area judgments, were asked to declare which of the two figures was the 
greater; and the other half were asked to judge which of the two figures was 
the smaller. The control group was divided similarly. Since the pooled groups 
show a significant difference, it becomes of interest to see if this procedure 


Produced any differential effect. 
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The four tables of Proportions were scaled by Case 111.5 The scale values 
obtained are given in Table 4. The chi-square entries in Table 4 give the 
results of the Mosteller consistency test, Since none is significant, it appears 
that Case III scaling is adequate. 


TABLE 4 


SCALE VALUES FOR “GREATER THAN” EXPERIMENTAL AND CONTROL GROUPS AND 
FOR “Lesser THAN” EXPERIMENTAL AND Co 


NTROL GROUPS 


Greater Less 

Object Experimental Control Experimental Control 
A 758 234 890 455 
B 859 -965 1.013 781 
О 1.770 2.185 2.504 1.945 
D 1.635 2.124 2.270 1.666 
E 0 0 0 0 
Е .396 «594. -603 476 
G 1.288 1.304 1.709 1.251 
x 15.528 12.048 7.150 16.84 

* At the .01 leve] and nine degrees of freedom, 


» chi square = 21.666. 


anaged as before, with the 
using 20 degrees of freedom. For the less-th 


:566; hence, this is 
and control groups of 
chi square of 37.184 which is annoyingly 
nsidered significant. 


would fit the data. Since the j 


experimental group were entirely greater-than judgment: 
down here, and the Q; values to be used here are the same as before, 


Fitting [4] to the Present data as before we get for the less. 
4'; 1.315 4, + .0027 Q;— .012 


than groups 


[5] 
а of the less-than groups, 


the obtained 
test against the less-than 


y the same as 


given by 
s from these Proportions, Thus it 


appears that 
15] is а good fit to the data. 
ma 


See Footnote 3, p. 154. 
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Is the accentuation in [5] of any value? After all, the value of .0027 seems 
small. Dropping the accentuation term we get 


4% = 1315 А,— 012 [5a] 


The 4^; values predicted from this equation, when applied to the experimental 
group proportions, yield a Mosteller consistency test of 11.182. Now while 
this is higher than [5] yielded, these 4’; values from [Sa] are still consistent 
with the actual proportions; hence we do not need to suppose that the accen- 
tuation term in [5] is significant. 

For the greater-than groups we get 


A’; = .798 4, + .0056 О, — .013 [6] 


The 4^; scale values predicted from [6] fit the greater-than experimental- 
group proportions virtually as well as do the scale values actually obtained 
from these proportions. The Mosteller consistency test applied to these 
predicted 4’; values gives a chi square of 15.656, which is much the same as 
yielded by the scale values actually obtained from these proportions, 

When we drop the accentuation term from [6] we get 


4', = .798 4 — .013 [6а] 


This gives a consistency test against the experimental-groups proportions of 
chi square = 30.416. This is a very considerable change from [6] and, with 
nine degrees of freedom is well beyond the .01 level. It appears that for [6] 
to give a satisfactory fit to the experimental Proportions the accentuation 


term is necessary. 
E. Discussion 


Possibly, what happens is as follows. The subject is called upon to judge 
which of two objects is greater in some aspect, such as size; and the actual 
difference in size is slight. That is, the judgments the subject makes involve 
considerable doubt and difficulty on his part. [How would you design an 
accentuation experiment in which all the judgments required were so clear- 
cut as to cause no difficulty?] As the subject struggles to decide, he also 
has knowledge of or can see that one object is greater than the other in some 
other dimension. As he vacillates in his judgment, he perhaps swings mo- 
mentarily toward one decision and since, to his knowledge, that object is 
greater (although in some other dimension) this makes his judgment fee] 


right and he commits himself. 
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In more general terms, the argument is that when judgments are being 
made in conditions of considerable natural variability a contamina 
able will serve to bias the variability. It is, of course, 
this biasing effect occurs in a way describable by [4] where е and & are the 


ting vari- 


A from Q. The presence of the Q’s in [4] rather than the original Bs is, of 
course, of considerable interest, One should never forget that a high degree 


difficulty. If, for instance, the accen i 
effect (that is, the object actually “looks larger” 


judgments, If, however, accentuation is a matter о 
the subject's vacillations in Tesponse, the differenc 
we note that the relations “greater than” and “ 


f a bias being imposed on 
© seems predictable, First 
less than” are converses of 
; but from a Psychological point 
t “greater than” is likely 


Possibly, then, this happened in the less-than grou 
simply has some extra trouble in making the less. 


uggle to judge “less than" 
ich object is "greater than” 


two tendencies 
that produced the bias we have no speci 

at least, to expect less bias. These guesses seem to be bo 
data. The accentuation effects seem to 
than groups. The 4% were: 


RAYMOND E. HARTLEY 161 


-0027 less than 
-0049 pooled 
:0056 greater than 
The “less than" was not significant, the pooled group was significant, and 
the d of the greater-than groups was significant. Apparently our observed 
accentuation effect was mainly if not entirely produced by the greater-than 
groups. It is of interest that the e’s are: 
1.315 less than 
1.088 pooled 
-798 greater than 
Since the unit of measure in pair-comparisons scaling is a standard deviation, 
these e’s may indicate that the less-than experimentals were finding their 
task more difficult—due to the conflict—than the less-than controls; whereas 
the greater-than experimentals were finding their job easier than the greater- 
than controls. Presumably the cooperation between the two dimensions made 
it easier for them to make up their mind, though more erroneously, 

This discrimination hypothesis clearly implies that both sensory and social 
variables obey the same law. There is no reason to expect any difference 
between the two except that a sensory variable Possesses an immediacy that 
is lacking in something like value, which is more abstract and conceptual ; 
thus while we expect the form of the law to be the same it seems likely that 
sensory variables will be more effective. 

I know of little evidence supporting this last prediction except for an in- 
adequate earlier study of my own. In this earlier study, the subjects were to 
judge the number of dots on a card; and the contaminating variable was the 
size of the card upon which the dots appeared. Due to various difficulties only 
35 subjects were tested, and the whole treated as a pilot experiment, Never- 


theless it turns out that the equation 
A’; = 1.740 4; + .051 Q; — .687 [7] 


gives a rather good fit to these data. The material does not warrant a signifi- 
cance test, but it is interesting that the d here is about 10 times as large 
as our previous d. Incidentally, the large & here is a result of the fact that the 
Square which was regarded as least numerous by the experimental group 
was a different card from the one regarded as least numerous by the controls, 


F. Summary 


It is argued that the experimental paradigm for perceptual-accentuation 
studies should be one of psychophysical scaling in which the objects involved 
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ROLE-DEVIANT RESPONDENT SETS AND RESULTING BIAS, 
THEIR DETECTION AND CONTROL IN THE SURVEY 
INTERVIEW*! 


Family Service of the Cincinnati Area and University of Cincinnati 


Merton S. Krause 


A. INTRODUCTION 


Research interviews are conducted for the collection of data relevant to 
certain variables of interest to the investigator. The data are coded, translated, 
or interpreted into estimated values for these variables and so mediate between 
the phenomenal referents of the variables and their value estimates, Whatever 
else, other than the relevant phenomena, influences the data and the estimates 
derived from the data constitutes a possible source of bias in the estimates. 
Since the critical manifestation of interview-data validity is in the value 
estimates for the variables of interest, data bias is of ultimate practical sig- 
nificance in the effect it has on the final statistics [i.e., as “net error" (32, pp. 
228-229) ]. 

Research-interview data proper are, of course, collected in the form of 
answers to questions. Whatever tends to divert a respondent's answers 
from being maximally responsive, recordable, honest, understandable, con- 
sidered, and complete, is a possible source of bias? We shall refer to this class 
of bias sources as respondent-bias sources even if induced by the interviewer 
or other “causes.” There are also question-bias sources, involving the systematic 
elicitation of irrelevant information in the answers (45, 54) and interviewer- 
bias sources involving surmise, respondent selection, inadequate probing, and 
recording errors (32). We shall concern ourselves in this paper solely with 
the analysis of the detection and control of respondent bias sources. 


—— 
* Received in the Editorial Office, Provincetown, Massachusetts, on April 15, 1964. 
Copyrigh y The Journal Press. А 3 
Pati а н wl nel under Public Health Service Research „Grant MH 
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Ryd lyn Winget. = А Р" 
É FU po Ae Aye in experiments with human subjects which is dis- 


cussed by M. T. Orne (53). Our numerous citations of me M d psychological 
testing literature testify to the existence of the same problem in t Ave chen and 
Lindzey (44, pp. 160-170) reviews it in the projectiv етп area e shall, how- 
ever, restrict our discussion to the voluntary and putatively direct type of interview 
Ог test situation (8); though our citations range more widely. 
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through perception or memory, formulate a m i 
he intends, and verbalize this message as his answer. The limits of his Percep- 
tion, memory, and experience—as well as of his co 

—limit the validity of his answer (e.g., 59 60) 
interviewer-bias or respondent-bias Sources, these limits should manifest them- 
selves in easily detectable Ways: ie., as “Don’t Know,” “Uncertain,” unob- 
tained, incomplete, insufficient, unintelligible, 
answers. With the exception of i 

special answers, these limits do no 
indirectly by attenuating in unkn 
which the estimates are based (jus 
at-home” %), This attenuation or 


They are not merely limits, 
for they may lead responde i dishonest, misrepresentative, 


What sets are Operative i 
respondent’s goals for the 


tencies: i.e. its habit character. It seems wholly consonant with Meehl an 
discussion (47), We have presented els iscussion of role d 
Sources in another context (38). 
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situation as being instrumental. His view of the situation activates or makes 
salient certain trans-situational motives (1, pp. 598-602; 46) that manifest 
themselves as answer-constraining sets, as well as in other ways. But the nature 
of the possible respondent bias depends more immediately on the specific 
sets operant during the interview than on their, possibly multiple, motivational 
Sources. ‘Therefore, bias control requires prior set detection for which re- 
spondent-personality assessment cannot be substituted even if such prior assess- 


ment were feasible (e.g., 24). 
C. SET-DETECTION METHODS 


There are at least six distinct methods of set detection. We shall start 
with the simplest and work our way through the more subtle and complex. 

The respondent’s goals in the interview situation may be, at least in part, 
quite overt. The more these goals manifest themselves outside of merely 
constraining the content of responsive answers (i.e., in byplay), the easier their 
detection. When the respondent is either willing to admit explicitly his 
ulterior goals or unable to control (or recognize) his byplay that expresses 
them, then this byplay readily lends itself to set detection. Thus, self-righteous 
uncooperativeness, fatigue, or deference to the interviewer’s opinions might 
well be manifest in byplay (including questioning the interviewer) because 
they are not likely to be frustrated by admission, and the interview situation 
readily permits their expression in ways independent of the content of the 
answers, Its ready expressibility in byplay favors such manifestation of a 
set like wanting to obtain the interviewer’s approval of oneself though it 
may be frustrated by admission. Neither expressibility nor admissibility 
favors the spontaneous expression in byplay of a set based, for example, 
on a fear of disclosing the fact of a criminal conviction (but the respondent’s 
Tesponsive byplay to a relevant question could serve to betray the set's 
existence). In sum, continual byplay expressive of sets concerning gross and 
Constant aspects of the situation, rather than merely specific questions, should 
&enerally provide behavioral grounds for inferring respondent set. | 

A respondent’s answers may be constrained by sets with regard to their 
understandability, recordability, responsiveness, completeness, thoughtfulness, 
or honesty. The first four of these answer variables, when deficient and not 
corrected by probing, contribute only to indirect (ie, sampling) bias. Possi- 
bly less obvious and therefore less amenable to Correction thoughtless or 
Unconsidered answers, which can occur in forms such as “I don’t know” 
or “I’m not sure” and may be merely incomplete or undetailed answers 
formally satisfying the question, can introduce direct respondent bias. Where- 
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as unresponsive or formally incomplete answers are tantamount to admissions 
of a set in contravention to the proper respondent role, many thoughtless and 
dishonest answers do not betray themselves or the set that they manifest. 
Mere thoughtlessness may be disclosed by probing, but dishonesty often is 
hidden by a respondent for, when it is admitted, it is no more effective than 
bald unresponsiveness in achieving the respondent’s goals and may have 
unfavorable side effects as well (e.g., loss of the interviewer’s trust and 
approval). Therefore, a respondent’s dishonesty, when intentional, probably 
is the most directly biasing, least discernible, and least remediable expression 
of any set. 

The content of a respondent’s answers can be the basis of a number of 
methods for the detection of sets concerning gross and constant aspects of 
the interview situation. All these are based upon the detection of dishonest 
answers and, just as in the case of byplay, dishonesty is contingent upon the 
perceived or apparent opportunity to advance the ends of its determining set. 
Thus, in order to seek the approval of his interviewer through deference to the 
latter’s opinions, a respondent must believe he knows what these opinions are 
relative to the questions asked of him (42). In order to lend himself status in 
his interviewer's eyes, a respondent must believe he knows how to answer the 
interview questions to this end (65). The consistency of his answers as efforts 
toward attaining a constant goal suggest that a respondent has a set that deter- 
mines this consistency [see (19, pp. 80-83) ]. There are at least two empirical 
ways to determine what pattern of answers is to be associated with a given set. 
First, have an independent or a matched sample from the respondent popula- 
tion rate the various possible answers to each question with regard to their 
probable effectiveness as a means to the satisfaction of a given set under the 
projected survey conditions; e.g., rate the social desirability of the various 
answers (19). Second, have the same kind of sample attempt to answer the 
questions in such a way as most likely to attain satisfaction of the given set 
(and, therefore, not give away their intentions when this would jeopardize 
set satisfaction). This could be accomplished in a role-playing (faking) or 
real-life situation (16, 58). However, in subsequently applying such a group 
of studied questions, it may have to be supplemented by other questions 
beyond those essential to the primary research requirements in order that set- 
indicative answer patterns can clearly manifest themselves unconfounded 
with the substantive interview variables [see (11, pp. 152-153), 13, 30, 68, 
p. 233) ]. If these answer patterns are both discernible and interpretable by a 
respondent, they will be inappropriate for the detection of his inadmissible 


sets. 
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Because it is possible that a series of honest, set-independent answers can 
also conform to the answer pattern of some set pattern (cf. 50) or that this 
latter has been imperfectly specified [discussed later under Bias Control 
through "answer balancing"], we must emphasize that answer patterns are 
not conclusive evidence in any individual respondent's case. Confirming by- 
play, of course, contributes to the evidence for the existence of a set and so 
do certain detection devices (in the interview) that reveal as such the re- 
spondent's efforts toward set enactment. 

"These devices are of at least three general types. The first type involves 
questions the possible answers to which have a considerable range of probable 
effectiveness and clear identifiability as efforts toward the satisfaction of 
some given set. The identifiability derives from the unlikelihood of certain 
indicator answers being given, except for that set. The independently de- 
termined falsity of the indicator answer or score for this respondent [eg., а 
“sleeper question” (32, p. 227)], its rarity in the population (25, p. 162), 
or its implausibility [e.g., “I never worry” (47, pp. 537-538) ] may be used 
as evidence of this unlikelihood. If memory errors, sampling error, and errors 
of interpretation are discounted (and if alternative sets can be ignored), this 
first type of detection device could provide sufficient though not necessary 
evidence for set detection. Since respondents may have idiosyncratic concep- 
tions of effective set enactment or hold views of answer rarity and plausi- 
bility comparable to those of the interview coder (27, pp. 17-18), the absence 
of indicator answers from this device cannot be held to be conclusive evidence 
for the absence of a set. e MES N 

The second type of detection device involves combinations of questions 
that can yield a certain internally contradictory pattern of answers only when 
a given set is active (42). Thus essentially the same question could be asked 
worded positively (ie. "You have . . . ?"); later, negatively (i.e, “You 
haven't... ?") to detect an acquiescence or deference set (28; 9, р. 328). 
ОЁ course, а respondent’s memory of what he said before and his skill at 
qualifying answers can help him conceal his set; so the absence of an indicator 
Pattern is, again, inconclusive. This type of device is Prone to alert the 
more intelligent respondents to our detection efforts and so itself elicit sets. 
Although subtlety of question wording can disguise the device, it also tends 
to blur the possible internal contradictions of the respondents answers, 

The third type of detection device involves questions relevant to i ap 
"hore sets that conflict for each possible unqualified pret Thus, Like 
Many people, you've sometime thought. about p я "s latrist haven't 
You?" could pit an acquiescence set against a social-desirability set for many 
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people. The promise in this device lies in the respondent’s self-revelatory 
qualifications in his extended answer which may disclose but need not disclose 
his set [for some indicator-behavior description see (35, pp. 47-49)]. 

The occurrence of spontaneous set-disclosing byplay, the matching of over- 
all answer patterns to those produced as manifestations of known sets, and the 
appearance of indicator answers to built-in detector questions are means 
of set detection that at most require sharp observation and reliable recognition 
by the interviewers. There is another means that requires considerably more: 
namely, controlled variations in reaction to the respondent's answers and 
byplay. This is not a departure from the model interviewer's invariably 
patient, accepting, moderately friendly, always interested, and neither judg- 
mentally reactive nor content-directive manner. The object of this reactive 
method is to give the respondent a response to his (suspected) set enactment 
that is surprisingly incongruent with his expectation of its results, but that 
does not impugn the interviewer's appearance of sincerity nor the rest of his 
role characteristics. An “unwittingly” but plausibly erroneous summarization 
of an answer contradictory to the client’s apparent set (eg, ". . . and you 
make almost $10,000 a year before taxes and...” to a respondent suspected 
of a social-desirability set who reported “. . . over 10 . . 27) is one tactic 
(17, pp. 480-481). If the respondent displays emotional involvement or an 
excessively effortful correction of the interviewer's "error," these may be 
taken as evidence of a set, though not of bias itself; while the absence of such 
effects is wholly inconclusive. (The probing of possibly set-biased answers 
may also evoke this sort of evidence.) However, the possible side effects 
of such emotional arousal make the use of this method advisable only infre- 
quently or at the end of interviews. 

From the preceding discussion, the crucial role of hypotheses about the 
nature of the sets to be encountered should be evident. The identification of 
set-pure answer patterns, detector-question design, and interviewer training 
in reaction tactics clearly presuppose such hypotheses; and alertness to byplay 
patterns and to reactions to probing would benefit from having ideas of what 
to look for. 

D. RESPONDENT SETS AND THEIR CONTROL 


Although persuasive empirical evidence of what motives manifest them- 
selves in a given respondent's interview behavior might be impracticably diffi- 
cult to assemble in a purely inductive manner (see 39), an a priori list of 
probably influential respondent goals can be assembled. Such a list, however 


4 They resemble some of the factors extracted in recent studies of derived response- 
set scores (7), social-desirability ratings (48), and response-set scales (3). 
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generated, would at least enable us to attempt some rational set detection 
and would be amenable to empirical testing and amendment (41, р. 194). 

We believe that most American respondents have, though to different 
degrees, at least the following goals in a research-interview situation: (а) to 
appear polite, (4) to appear honest, (c) to express one’s side, opinions (or 
even mood) and to be understood, (4) to avoid embarrassment or loss of face, 
(e) to avoid getting into trouble through what one says, and (f) to avoid 
excessive effort or emotional upset. The first three of the foregoing are con- 
sonant with ideal respondent role enactment: responsive, understandable, 
recordable, formally complete, considered, and honest answers to all ques- 
tions. However, they may also manifest themselves in (а) apparently socially 
desirable and acquiescent answers, (5) apparently socially desirable and lie- 
concealing consistent answers, or (c) nonresponsively self-assertive answers 
respectively. The last three goals—avoidance goals—are probable sources of 
only social-desirability and personal-desirability (57) biased answers. They 
tend to influence answers with respect to the latter’s (4) probable effects on 
the interviewer or others present, (е) salient others in general who may get 
access to the respondent’s answers and use them against him, and (f) the 
respondent himself respectively. These dispositions to achieve goals which 
may conflict with ideal-respondent role enactment are the sets upon which 
we shall base our discussion of specific set detection and control. 


1. Politeness 


The disinclination of a respondent to discountenance or displease his 
interviewer will depend in the way it affects his behavior upon what actions 
he believes would cause this to happen [see (32, pp. 162-167)]. The more 
“flank exposed" by expressing or implying attitudes, even inadvertently 
(see 31), liable to contradiction by complete and honest answers, the more 
vulnerable the interviewer makes himself to bias from this quarter. Mere 
ambiguity, however, may induce the respondent to react solely on his own 
needs and suspicions (5). Therefore, the best initial tactic would seem 
to be a plausible definition of the interviewer's role and evidence of „his 
durability and neutrality (15) as well as the avoidance of excessive: i.e., 
seductive, rapport [see (32, рр. 46-52) ]. Резо to questions can be used 
toi Seinforce these points foo (eg, “some people I've talked to have said хе 
while others tell me . . ."). When the interviewer suspects he has received а 


re ean Я 
t . i cquiescence [see (20)]. For 
5 ing questions may induce ac 
а ges неше uc da c des to influencing interviewees see Blake and Mou- 
ton (4). 
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politeness-biased answer, his probing should manifest these same points. 
Byplay and the effects of early detector questions may originally alert him to 
the danger. 


2. Honesty 


'To appear honest a respondent must answer in a consistent manner, avoid 
or explain any evident reasons for being dishonest, and reject any implica- 
tions of dishonesty. This means that should the respondent suspect the inter- 
viewer of doubting his honesty, he may attempt to preserve appearances (and 
he will tend to avoid arousing such doubt). The bias problem here is mainly 
one of spurious, lie-concealing consistency in a respondent’s answers. This 
consistency itself, especially if reinforced with byplay cues, may alert the inter- 
viewer; but he must not give any indication of being suspicious or of having 
caught inconsistencies during his initial presentation of the interview material 
(66, p. 10 ff.). Confrontations are less disruptive of the interview as after- 
thoughts while “looking over" the completed forms, whether made most 
directly or in the form of probes of indicator answers to detector questions. 
However, they may be prerequisite to rapport with respondents who are 
"conning" the interviewer (36, pp. 55-56). 


3. Self-Expression 


'The respondent's set to have his say would not seem to be a source of 
direct bias but rather a source of possible sampling bias in that the interviewer 
fails to get a full complement of answers from such respondents. Detection 
of this set is straightforward, but its control may be a trying matter (e.g., 
60). A clear introduction ру the interviewer and an explicit agreement as 
to his role by the respondent is a useful precondition to using reminders of 
the interview's purpose as an initial control tactic. When reminders fail, 
the interviewer has little choice but to gratify the respondent, interject what 
questions he can in a responsive manner, and patiently “wait out" the re- 
spondent in order to complete the interview. 

When the self-expression is that of mood, it should be evident in the 
respondent's byplay; but it also may affect content—e.g., it may make 
him overly critical or pessimistic on attitude questions [see (35, pp. 89-99)]. 
The best control is probably to arrange а reinterview. 


4. Saving Face 


A respondent’s efforts to achieve or maintain a sense of respect and accept 
ance from the interviewer may lend themselves to bias (32, pp. 167-169, 201- 
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203, and 311-312). They presuppose, however, some belief (whether explicit 
or vague and inchoate) about what the interviewer respects or rejects. Since 
this belief may be false or unknown to the interviewer (especially when he 
differs in class or culture from the respondent), the respondent’s pattern of 
answers and byplay may be unrevealing, and reactive methods of detection 
are contingent upon clear suspicions. The respondent’s possible concerns with 
social status (as reflected in the interviewer's clothing, manner and accent, 
language, or auspices), morality, political identity, mental health, patriotism, 
professional or local loyalties can be prepared for and can reasonably be 
expected. The general methods of detection are those we have discussed, 
but there are also certain general methods of control. 

We should, first, attempt to render as many as possible of the substantive 
grounds for face saving, which seem likely sources of bias, irrelevant to the 
interview. Auspices unsuspected of having any strong answer preferences— 
e.g., not a church group for a survey on religious practices [see (32, pp. 185- 
190) ], introductions and question sequences unevocative of ideas of interested 
reference groups for the respondent (52), interviewer's apparent class and sub- 
cultural "identity" neutral relative to the questions—these are optimal; but 
they presuppose considerable knowledge about the respondent population and 
that it is homogeneous. 

Second, and equally difficult, an effectively neutral and convincing sponsor, 
purpose, and interviewer identity must be created to prevent biasing assump- 
tions or projections by the respondents. Observed trial interviews are neces- 
sary to detect and eliminate obvious set-evoking cues from the interviewers. 

Third, the equal acceptability to the interviewer of all answers ought to 
be demonstrated to the respondent. Precoded answers may be presented in 
the question itself as a set of equally reasonable alternatives. When a frce 
answer is sought, alternatives can be demonstrated with probes. Introducing 
these with, e.g., "Several people have told те. . . have you run into this sort 
of thing?” or using the respondent’s language well (15) can serve to display 
the interviewer's neutrality. 

Finally, the interviewer must help the respondent establish an identity 
that he will not need dishonesty or evasion to maintain; thus it may be 
useful to appear comfortable in his home during the interview, if economic 
matters are to be discussed. Again, an opening series of relevant biographical 
questions may be helpful in preventing the respondent from attempting 
to create a false impression about his educational experience (say, by out- 
flanking him before he starts). It can also give the interviewer an opportunity 
to display his acceptance of and respect for the respondent as he really is. 
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5. Foresight 


A prudent respondent will avoid harming himself by what he says (e.g., 32, 
pp. 146-149, 159-162 and 169-170). He must assess the probability that the 
wrong persons will be informed of his remarks and the degree of probable 
harm he would then suffer. This assessment presumably guides his disclosures. 
[Hyman's discussion of respondent anonymity is relevant here (32, pp. 182- 
185; also see 18).] An interviewer usually can hope only to influence the 
respondent’s expectations about what persons will have access to the data 
or research reports. If the auspices, introduction, and interviewer's identity 
and manner do not seem sufficient to assuage a respondent's fears, there are 
at least two supplementary tactics available. The first and better tactic is 
to encourage respondents to reject questions they would prefer not to answer. 
This encouragement should come after some rapport has been established and 
some satisfactory respondent role enactment has occurred (and before any 
"dangerous" questions) in order to maximize its likelihood of being believed, 
Refusals to answer are obvious set indicators, do not yield direct bias, allow 
the interviewer to try alternate questions, may be discussed to explore the 
respondent’s set or influence it, and (if the answer is not obvious from the 
mere fact of refusal, which would tend to vitiate this first tactic) reduce the 
danger of bias emanating from efforts to answer consistently. The second 
tactic calls for standard set detection, especially answer patterns (when the 
interviewer suspects who it is the respondent fears) or detector questions. 


6. Motivation To Enact the Respondent Role 


If the respondent openly displays or admits his unwillingness to be inter- 
viewed, only sampling bias—due to his incomplete or refused answers—is 
likely to result. If his resistance is covert, however, direct bias from dishonestly 
casual or mischievous or from unconsidered answers may result. In the former 
case, the respondent's overt disinterest, preoccupation, intellectual objections, 
or fatigue should probably be explicitly recognized (in as face-saving a 
manner as possible) and openly accepted as reasonable by the interviewer in 
a calm, polite, understanding, and frank manner. The subsequent discussion 
could then serve to motivate the respondent to enact his proper role, arrange 
another time, or terminate the interview. 

The latter case, that of the ambivalent respondent, is quite another matter. 
He wishes to avoid or escape the interview, but he also wishes to go on with 
it or to leave the impression of wanting to go on with it, Thus, he either 
cannot or will not easily be induced to admit any desire to stop the inter- 
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viewing. He gives signs of strain, which may be exacerbated by any attempts 
to focus attention on the strain or to discuss it. These signs will appear as 
signs of tension, anxiety, distractibility, anger, or mood changes, in the 
onset of problems in hearing, comprehension, or expression. When the re- 
spondent’s ambivalence concerns gross and constant aspects of the situation, 
such as the interviewer, the fact of being interviewed, or the general topic 
it will be difficult to distinguish it from possible alternative sources of strain, 
such as chronic states of “intrapsychic stress” (neuroses, for example) or 
current acute life problems. If this background level of strain does not dimin- 
ish, regardless of the situation’s increased familiarity with time, the inter- 
viewer’s accepting and understanding manner, and the respondent's interest 
in the questions, there is little the interviewer can do. The need for ego- 
supportive measures beyond those described should be taken as a signal to 
extricate oneself from the role of research interviewer gracefully and without 
offending or further straining the respondent. The application of such mea- 
sures should also be noted as a potential elicitor of politeness-set effects or, 
if too obvious, face-saving set effects. 

When the respondent’s signs of strain are transitory or variable, it sug- 
gests that his ambivalence concerns specific questions rather than gross and 
constant aspects of the situation, assuming that all set-evocative byplay on 
the part of the interviewer is absent. In this case the interviewer must exercise 
considerable skill relative to these questions in avoiding probing too deeply, 
varying the question order, tolerating diversions and free answers even 
when irrelevant or redundant, reducing the degree of directness or structure 
in the questions asked, avoiding inferential or interpretive restatements, and 
still working toward the goal of the interview as far as seem advisable. Such 
liberties on the interviewer's part may increase the danger of question and 
interviewer-bias sources. Also they presuppose exceedingly alert and sensitive 
interviewers, who have been trained to watch the effects of their questions 


as well as listen to the answers they elicit. 


E. Bias DETECTION 


There is only one means of detecting bias in any given data or final statis- 
tics, and that is to compare them with less-biased data or statistics. When data 
on the same variables have been independently collected in two or more ways 
from the same respondents, one of which is believed to yield less-biased 
results than the others, bias detection is possible (32, pp. 226-227; 66, 
p. 48-49) ; otherwise it is not possible. Criterion or control data are selected 
as such because in all regards that are believed to be susceptible to bias they 
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either seem inherently more valid on the basis of rational argument or they 
are believed to resemble more closely another class of data that provides a more 
valid criterion were the other data available. Thus, bias detection requires 
that we have available comparable data or statistics of greater validity than 
the data or statistics at issue. When one questions respondents about matters 
of recorded fact, such as age, car ownership, or public assistance, it is probably 
easily demonstrable by rational argument that the official records are more 
valid than the respondent’s answers are likely to be [see (65, p. 381)]. 
However, the subjects of many surveys are not matters of recorded fact and 
so are not amenable to such straightforward bias-detection comparisons, 

In the area of attitude, opinion, or knowledge surveys, the criterion data 
are those which have been collected under different respondent-set conditions 
than the data at issue. (We are, of course, still slighting both interviewer-bias 
and question-bias sources.) If one is persuaded of a lower probability or 
degree of bias under the criterion conditions, he can prepare a correction 
factor to be applied to statistics derived from data gathered under the experi- 
mental (usually the more convenient or common) conditions. If one is not 
so persuaded, he is bound at least to include warnings in his reports when the 
conditions compared yield significantly different statistical values. His per- 
suasion must derive from arguments concerning, inter alia, matters of set detec- 
tion and control. The detection of probably biasing sets in the “experimental” 
conditions not detected in the criterion conditions, the absence of any other 
biasing sets in the criterion conditions, and a significant difference in final 
statistics between the two conditions (which can be traced to differences 
in answers that should reasonably have been affected by the set-control tech- 
niques used in the collection of the criterion data)—this is the evidence upon 
which arguments of comparative validity must rest. 


F. Bias CowTROL 


The biasing effects of respondent sets can be controlled in several ways. 
Perhaps the most obvious way is to attempt to increase the respondent's 
motivation to answer the interviewer’s questions responsively, recordably, 
understandably, completely, thoughtfully, and honestly, so that competing 
secs are frustrated. To this end we strive to find some way to elicit motivation 
in the respondent which we expect would induce him to enact his proper 
role: adopt auspices with which he identifies, project uses for valid data 
which would benefit or please him, bring appropriate membership of refer- 
ence groups to his mind, reinforce his role enactment (22), inspire empathy 
with the interviewer's task of getting unbiased answers, etc. [see (34, pp. 65- 
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91)]. This means is the foundation of bias control. It is direct and preventive, 
necessary though not always sufficient. There are, however, at least six other, 
supplementary means of bias control possible (cf. 32, p. 276). 

First, se? control, which has been discussed, is intended to decrease the 
degree to which role-deviant sets constrain the respondent’s answers and 
thereby prevent the occurrence of biased answers (32, pp. 308-312). It works 
by inhibiting, harmlessly satisfying, or avoiding eliciting motives in the re- 
spondent competitive with his proper role enactment (34, pp. 193-201). 

Second, set counterbalancing, which acts to cancel the expected biasing 
effects of one set and the means of satisfying it by eliciting another set or means 
of set satisfaction of equal and opposite expected bias and so prevent bias 
[see (63) for a special case]. Thus, an apparent set to be polite, which may 
be manifest by exclusively favorable answers regarding the interviewer's 
auspices, might be balanced by a slightly critical comment from the inter- 
viewer about his sponsoring organization. Such a remark could serve to check 
the tendency toward positive bias in answers concerning this organization. 
Again, a respondent's tendency to restrain his criticism might be countered 
by probes challenging his perspicacity: “Several people had some strong things 
to say about the wait for an appointment, what was your impression of it?” 
If these are made in a calm, unprovocative manner but with the evident 
expectation that the respondent ought to have some impressions, they may 
elicit a politeness or face-saving set contrary in effect to his prudent caution. 
Counterbalancing clearly requires that the set ostensibly biasing the respond- 
ent’s answers is known because the design of appropriate questions, probes, 
remarks, and introductions is contingent upon such knowledge. Pitting two 
oppositely biasing sets against each other is just what we proposed for the 
third type of set-detector question, but now we are interested in the 
respondent's final answer rather than in his process of answering. Naturally, 
the likelihood of perfect counter balancing is rather remote; so the best 
result we should strive for is to have the respondent compromise on a less- 
biased answer than if his natural set were unopposed. Care must be taken not 
to overcorrect his probable bias. No matter how weak is the conflict we 
induce in the respondent, however, it may act (a) to lengthen the interview 
through increasing decision time (e.g, 2); (0) to increase the respondent's 
strain (43, pp. 260-269), (c) to decrease the test-retest reliability of the 
answer (35, pp. 15-67), and (d) to cause fluctuation in direction of im- 
balance from question to question. 

Third, answer balancing aims to equate the degree of set satisfaction for 
a given set in the several possible alternative answers to a question (19, pp. 
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59-71) or in the answers to several questions directed at the same informa- 
tion (eg, 23). If successful this would render that set incapable of 
biasing respondent’s answers to the question and so prevent bias rather than 
merely correct it. To achieve this, however, the alternative answers must be 
completely indistinguishable as to their degree of the given set’s satisfaction. 
Therefore, some means of selecting a group of balanced alternatives must 
be used. Edwards (19, pp. 1-31) describes one means of approximating such 
a result for class-determined or subculture-determined sets by using an inde- 
pendent random sample of the respondent population to scale the alternatives 
with regard to their degree of satisfaction of probably relevant, common, and 
influential sets.° If this scaling can be done and is relatively free of bias, if 
the population of interest is homogeneous in natural set, if the reliability 
over subjects of the scale-value assignments is high,” if the bias contributions 
of the various relevant sets are additive or a full factorial design was used 
in the scaling study [see (33, pp. 247-249) ], and if the contributions of other 
sources of respondent bias than the specific sets covered in the scaling study 
are negligible, then and only then could a reasonably near answer balance 
be assured. However, anything short of perfect balance would probably con- 
tinue to yield considerable bias (eg. 10; 21) and only result in making 
the set-dominated respondent's forced choice more difficult (19, pp. 59-60) 
which may tend to induce conflict in these respondents (19, p. 60; 40). 

The degree of balance achieved in a group of alternative answers with 
regard to their satisfaction of class-determined or subculturally determined 
sets can be tested for certain respondent populations by comparing the answers 
of a random sample of respondents motivated to enact a given set (i.e, to 
"fake") with those of an independent random sample not so motivated (19, 
pp. 55-58; 58). If the popularity of the various alternative answers to a 
question differs significantly between the two samples, these answers are not 
balanced relative to the given set for this respondent population (e.g., 6). 
Of course, population heterogeneity with regard to natural set, high item 
variances from other sources, or considerable popularity of the indicator 
answers under the control condition (68, pp. 230-232) would necessitate 
large subject samples to minimize the likelihood of erroneously finding a 
condition of balance to exist. 

A logical extreme of answer balancing is the use of nonloaded or evenly 


9 The social desirability variable studied by Edwards may actually consist of sey- 
eral variables (e.g., 6, 19, 48, 57) and be overshadowed by idiosyncratic Personal 
desirability (e.g., 29). : | р 

7 This may well Бе lowered by variance due to differences in interview settings 
[see (37) and (55)]. 
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loaded questions (54, рр. 177-202): i.e., of set-irrelevant, subtle, or neutral 
questions (67). These are tantamount to answer balancing at relatively 
low set-satisfaction (or dissatisfaction) levels, though their development 
primarily involves problems of question rather than answer selection. Em- 
pirical procedures like those described are properly included in the selection 
or testing of such questions. 

Fourth, intrainterview bias-elicitor balancing involves using two or more 
questions in different forms or positions, with different introductions or asked 
in different ways which are designed to elicit the same information in a 
manner such that some combination of the answers is relatively unbiased (14). 
It is appropriate to the correction of bias in single interviews. There are two 
distinct models of bias-elicitor balancing: the probabilistic and the determinate. 
'The former assumes, for example, that the biasing effects of the universe of 
sets are symmetrically distributed about zero, so that a random sample (with 
respect to set elicitation) of questions-in-context about the same information 
would tend to have a mean bias of zero (but for sampling error). The major 
obstacle in principle to realizing this model, if it is valid, is the drawing of 
such a sample, since the necessary universe of questions-in-context is merely a 
theoretical construct. However, the practical effects of asking several such 
questions in an interview (i.e., as itself a source of set elicitation) may be as 
serious an obstacle. 

The determinate model, though also subject to the latter practical problem, 
has a different theoretical problem to face. It assumes that any set against 
which it is used can be reversed in direction of bias, though unchanged in 
degree, by some alteration of question-in-context. The simplest instance of 
this model is probably that designed to control intrainterview bias due to an 
ss set (or its converse, an oppositional set). Thus, 
one might ask a respondent to express his agreement or disagreement to a 
pair of statements (separated in the interview by considerable other "neutral" 
material), such as (a) "It's hardly fair to bring children into the world the 
way things look for the future,” and (2) “Children born today have a 
wonderful future to look forward to” (from 42). There are four possible 
answer patterns: (a) agree-disagree, which is pessimistic, (5) disagree-agree, 
which 4s optimistic, (е) sgrétdgren, whitch ds amiest айа (2) digte 
disagree, which is oppositional. If single optimistic answers receive a unit 
score and single pessimistic answers receive a zero score, the sum of the residual 
(unbiased ) answers® would be zero, two, one, and one respectively. Thus, 


acquiescence or politene: 


phrased as (5) and if a respondent's 
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for the possible direct bias of either question alone, we can through set- 
elicitor balancing substitute the indirect or sampling bias of an unusable total 
response (as a score of one may be interpreted as indicating here). This means 
of bias control is, of course, homologous to the second type of set-detection 
device. It must be noted that with more questions than two, considerably 
more refined and complex, though not obviously preferable, scoring procedures 
are possible [see (30) ].9 

Fifth, interinterview bias-elicitor balancing can serve to minimize net bias 


possible when some situational, procedural, or interviewer variables are re- 
liably divisible into subgroups or subranges of which two are suspected of 
eliciting biasing sets opposite in direction of bias. When both supposedly 
biasing subgroups are used, they may be balanced against each other. 

If we assume that a question of interest has only two possible answers, 
А; and As, that the proportion of respondents giving A, in the absence of 
any relevant set elicitors is x and the proportion giving As under these condi- 


its opposite number favoring А» is аз, and that the random sample of 
respondents upon which % operates is 1 > фу > 0 of the total sample of 


respondents while аз operates on Фа = (1— A), then the net bias will be 
zero if and only if 


л hi 
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total disagreement with (a)'s by Xag and total disagreement with (2)'s by Xpa then 
[Xj + X ba/X aa + Xaa + Xba + Xpal will yield this sum score. If this score is signifi- 


Agree Disagree 
a phrasing жа au 
, 
b phrasing Xba Xba 


evidences pure set determination of the respondent’s answers, Sample estimates of the 
relative contributions of set and honesty to the obtained results are also Possible 
9, 49). к | 
( ò p equal numbers of questions phrased on each side of the balance is not 
a necessary condition of using these procedures, it does (a) safeguard equally against 
either direction of bias and (b) provide the most conservative null hypothesis because 
the population proportion of highest sampling error is one-half. 
19 Since the respondent bias in the estimate of x from the sample is } 
(1— л) — as (1 — 2) x and that in (1— x) is by = — b; ifb 
readily derived. 


1 = a, 
1 = 0, the equation is 
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Even rough estimates of a2/o, and 1/(1—z) can serve to reduce net bias 
(through the adjustment of 5; and ё») if they are in the correct direction from 
unity. In the absence of all reasonable grounds for estimates, 5; == .5 has 
been recommended as the best bet because the maximum Possible net bias, 
given some particular numerical difference between an estimate of x based on 
the respondents subject to a, and an estimate based on those exposed to о», 
is less for this value of p than for any other (32, pp. 315-316; 51, pp. 186- 
188). 
This procedure would be appropriate, for example, where a relatively wide 
range of discriminable social-class members were being used to interview 
middle-middle-class respondents, and it was suspected that these respondents 
would be biased in reporting their expenditures due to face saving tendencies 
with upper-middle-class interviewers and due to politeness with lower-middle- 
class interviewers. If the average expenditures for the sample respondents is 
the most important statistic to be obtained, this is the procedure of choice. 
If, on the other hand, certain cross-classifications are of primary interest, 
it is gross bias regardless of direction rather than net bias that should be 
minimized (32, рр. 317-318) : i.e, Procedures 1 through 4 are then prefera- 
ble [but see (61)]. А А 

Sixth, statistical adjustment presupposes bias detection and measurement, 
as do Procedures 2 through 5; but uses this information to correct rather 
than prevent bias. Given an observed respondent answer or score or sample 
statistic, we must know how much to adjust it in what direction. This 
depends, for our purposes in this paper, upon the following several factors: 
(a) respondent-set elicitation or activation, (b) bias opportunity, (с) bias 
propensity [see (56)] given (a) and (0), and (4) actual bias Cien the 
ween observed and unbiased answer, score, or statistic). 


signed difference bet pe , 
F to make statistical adjustments, but 


The last is all we really must know | i 
usually it will be more feasible in practice to use combined measurements or 
a 


estimates of the other factors. If we may suppose that these factors each 


assume values of one or zero for any specific respondent and are statistically 


independent, a simple example can be given. | 
If (а) 40 per cent of a respondent sample were, say, induced to try 


saving face on questions concerning income, (0) all respondents were asked 
whether or not their income was over a certain amount and for 75 per cent 
it was not, and (c) half of those who were motivated to save face and could 
positively bias their answers [i.e. 5X (4X 75) = 15 A cent of the 
sample] did so, then knowledge of (а), (0), and (c) for eac! respondent 
would permit perfect bias correction for this source, for it is tantamount 
to recognition of every answer biased from this source and of the unbiased 
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answer for each. Any lesser degree of information! would make the correc- 
tion of gross bias imperfect, while only precise sample values of (a), (5), and 
(c) would suffice to wholly correct net bias.12 "Thus, even in the simplest 
case the information required for gross (respondent-by-respondent) statis- 
tical adjustment for bias is unlikely ever to be available, and even that re- 
quired for net (sample-estimate) adjustments can only be roughly estimated : 
most roughly for (c); least for (a)—if set detection is vigorously pursued. 


С. CONCLUSIONS 


Bias control is our ultimate objective, but in order to know how closely 
we have achieved it (and, for Procedures 5 and 6, even to attempt it) bias 
detection is required. Insofar as role-deviant respondent sets are the source 
of bias, their control can reduce bias and, on an experimental basis, pro- 
vide the conditions for bias-detection studies. Set control, however, can 
only be evaluated and, on the basis of prior studies of the same respondent 
population, efficiently designed to the extent that set detection is effectively 
utilized. The remaining respondent-bias control procedures, the second through 
the sixth, also depend upon prior set-detection studies and therefore on 
effectively utilized set detection. Thus, the validity of survey-interview data 
and statistical estimates can be insured against respondent bias only by prior 
studies for set detection and then for comparisons of various relevant set 
and other respondent-bias control procedures. In this way only can an 
investigator systematically minimize the Possible respondent bias in his atti- 
tude-survey, opinion-survey, or knowledge-survey findings.1® Unfortunately, 
he cannot be certain of having effectively avoided all respondent bias be- 
cause some unanticipated sets may be operative and because the set-detection 
methods are prone to false-negative results. This limitation is not grounds 
for ignoring the preparatory research on respondent bias suggested, for such 
research is а prerequisite of interview-situation design, questionnaire design, 
interviewer selection and training plans, the survey's "advertising campaign 
plans," and the construction of statistical adjustments. 


11 It is obvious that observed answers, Scores, or statistics cannot be properly ad- 
justed on the basis of (a) or even a, information alone [as Webster (64) has pro- 
Pa EO ac = a, and b = л, as a, and z were used in discussing Proce- 
dure 5. Therefore, given а, and x and ? = 1, by = a, (1 — x), and so the estimate 
x is corrected by subtracting b; from it. In our example above, x = 7 _ AS = 25 
per cent [see (32, pp. 316-317) ]. 


18 Except where he is repeatedly predicting a criterion (eg, an election) and can 
develop appropriate item weights by multiple-regression techniques. 
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To develop a maximally valid survey of opinions or attitudes and so mini. 
mize the possible errors attributable to poor sampling procedures ele 
ambiguity, or uncontrolled respondent bias, one requires a савай project 
that may be more demanding than the final survey. di 
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A COMPARISON OF THE DIRECTION-OF-PERCEPTION 
TECHNIQUE WITH THE LIKERT METHOD IN 
THE MEASUREMENT OF ATTITUDES* 


Department of Physical Education, University of 
California, Santa Barbara 


BanBaRA L. DRINKWATER 


A. INTRODUCTION 


In a recent study relating attitudes of college teachers to teaching effec- 
tiveness two techniques of attitude measurement were employed. One, the 
Likert method of summated ratings, has been widely used; and considerable 
data on its validity and reliability are available in the literature. The other, 
the direction-of-perception technique, is relatively new; and information re- 
garding its usefulness as a tool of attitude measurement is not as readily 
available. The use of both techniques in a single study should provide further 
information regarding the relative merits of the two techniques. 


1. The Likert Technique 


In the Likert method of attitude measurement the respondent is asked to 
check one of five possible answers to each of a series of statements. In this 
study the choices, preceded by their respective weights, were as follows: 1) 
I strongly disagree, 2) I disagree, 3) I am undecided, 4) I agree, and 5) 
I strongly agree. An individual's attitude score is the sum of the point values 


for the choices selected. 
2. The Direction-of-Perception Technique 


The direction-of-perception technique (1) is an adaptation of Hammond's 
error-choice method (2) in which a series of multiple-choice questions are 
prepared, with foils equidistant from the truth ог at two opposite extremes 
if the true answer is indeterminate. An attitude is considered an affective 
state that may produce error in perception and recall, and an individual's 
attitude is determined by measuring the constancy of error in his choice of 
answers (2). А & » 1 : « 

Bernberg objected to the emphasis on "error" in perception because, To 
assume error or distortion in perception places a value upon the attitude which 
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is considered good or bad depending on the views of the test constructor or 
persons external to the situation” (1, р. 399). Bernberg, instead of basing 
his questions on fact, selected stimulus materials with no factual basis, 
theorizing that a respondent would choose answers in line with the direction 
of his perception as determined by his need-value system. An example of the 
type of question is found in Bernberg’s survey of employee morale in a large 
aircraft-manufacturing plant: 
When workers have worked side-by-side for a considerable length of 


time each individual worker becomes friendly with about: (a) 65% of 
the group; (b) 80% of the group (1, p. 402). 


Bernberg obtained evidence for the validity of the direction-of-perception 
technique by comparing management ratings and union ratings of morale with 
the test results. The test scores corroborated the judgment of management 
and the union ratings and had a split-half reliability of .86. 


3. Selection of Foils 


Little information has been published on the selection of foils either for 
the error-choice method or the direction-of-perception method, particularly 
when no factual answer is available. Hammond's and Wechsler's choices of 
responses (2, 6) were numerically equidistant from the true answer when 
such answers existed or equidistant from two alternate extremes when no true 
answer existed. Bernberg (1) provided two types of answers: (a) numerical 
percentages and (b) verbal statements. After the administration of Bernberg's 
test, some respondents said that ". . . there was too small a difference in the 
percentages used in some of the questions" (1, p. 404). This comment sug- 
gested the need for inquiry into the effect on responses of the numerical 
separation of the foils. 


B. PROCEDURE 


'The comparison of the Likert method of summated ratings and the direc- 
tion-of-perception technique was part of a larger study on the relationship of 
teacher attitudes and the effectiveness of college teaching. Each of the two 
attitude instruments contained components designed to measure (a) attitude 
toward college students, (5) attitude toward the technical aspects of teach- 
ing, and (c) attitude toward student ratings of teacher effectiveness, 


1. Subjects 


The subjects were 410 teachers randomly selected to represent universities, 
liberal-arts colleges, and teachers colleges of the midwestern states, 


) 


mL 
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2. Selection of Attitude Items 


Current literature pertaining directly to college students was used as the 
basis for preparing an Attitude Inventory of 36 statements to measure teacher 
attitudes toward college students. In general the statements sample opinion 
on student behavior, ideals, attitudes, academic interest, and the nonacademic 
responsibilities of the teacher to his students. Effort was made to phrase the 
statements in such a way that the direction of the favorable response was 
obscured. Corresponding questions were prepared for the Questionnaire for 
College Teachers, the direction-of-perception instrument. The order of the 
items on both instruments was the same and was determined by a random 
selection of the index cards on which the statements had been typed. 

Twenty-four items to measure attitude toward the technical aspects of 
teaching were selected from studies listing the instructional competencies 
relevant to classroom teaching procedures: (а) preparation for class, (5) 
testing procedures, (c) conduct of the class, (4) the use of teaching aids, (е) 


rapport with students, and (f) educational methods. 

Because the cooperation of the respondents depended in part on their 
willingness to be evaluated by students, a section was included dealing with 
the respondent’s attitude toward student ratings of teacher effectiveness. The 
statements used were taken from studies reporting results of research on many 
of the objections to student ratings and from the Manual of Instruction for 
the Purdue Rating Scale for Instruction (4). The results from this portion 
of the attitude instruments were used to check the bias to be expected if only 
teachers possessing favorable attitudes toward student ratings participated. 


3. Selection of Direction-of-Perception Foils 


Hammond’s suggestion of using more than two answers for each question 
(2) was incorporated to provide some indication of attitude intensity. Four 
methods were used to determine the numerical separation of the answers, On 
30 questions four answers were separated by a single value ranging from 
seven to 21 percentage points. For example in the following question, each 


answer is 21 percentage points distant from adjacent answers: 
t of college students indicate that they would take 


cen! t 
йүз pet m the room to cheat on an examina- 


2 fro! 
advantage of a teacher’s absence 
tion? (a) 21%, (b) 42%, (с) 63%, (4) 34%. 

Fifteen other questions had foils separated by a single value but were 
arranged so that corresponding favorable and unfavorable choices were equi- 


distant from a value of 50 per cent. 
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What per cent of college teachers would prefer to have their effec- 
tiveness judged by their department heads rather than by their students? 
(a) 82%, (b) 66%, (c) 34%, (d) 18%. 
On 12 questions answers were not separated by any common value, but 
were distributed so that those answers equally favorable or unfavorable were 
equidistant from a value of 50 per cent. 


What proportion of college students condone cheating as an acceptable 
practice? (a) 33%, (b) 45%, (c) 55%, (d) 67%. 


On 14 questions the favorable choices were separated by four points, the 
distance between a favorable and unfavorable choice was six points, and un- 
favorable choices were four points apart. 

What proportion of college teachers feel that the ability to speak “off- 
the-cuff” is more important than detailed preparation for class? (a) 25%, 


(0) 29%, (c) 35%, (d) 39%. 


Ten questions had miscellaneous types of answers in which, for example, the 
choices were expressed in terms of “greater,” “higher,” “more,” or “always.” 

The variation in the number of questions in each category resulted from 
the necessity of providing the questionnaire with face validity. For example, 
in answer to some of the questions answers as large as 50 per cent would have 
appeared unrealistic. To determine whether the methods of selecting foils 
affected the responses, the answers within each category were compared with 
replies to the corresponding Likert items. 


4. Comparability of the Attitude Instruments 


To determine if the corresponding items on the two instruments were 
comparable in content, a combined form of the two inventories (with ques- 
tions from the direction-of-perception instrument in Column A and cor- 
responding Likert statements in Column B) was presented to a class of 30 
graduate students. The students were asked to read both the question and 
the statement carefully and to indicate on a five-point scale how well they 
felt the two agreed in content. The scale choices were as follows: (a) no 
relationship between attitude statement and question, (5) very little rela. 
tionship between attitude statement and question, (c) a fair degree of rela- 
tionship between the attitude statement and question, (d) attitude statement 
and question cover approximately the same topic, and (e) attitude statement 
and question cover precisely the same topic. Sixty-two of the paired items had 
an average score of 4.00 or better, while the remaining 19 scored between 
3.50 and 3.99. Because all the matched items were judged to haye better than 
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a fair degree of relationship, it was decided to retain all 81 items on each 
attitude instrument. 
C. RESULTS 


The reliability coefficients of the attitude instruments shown in Table 1 
are coefficients of equivalence as estimated by the split-half method and cor- 
rected by the Spearman-Brown formula. With the exception of Area 2, 


TABLE 1 
RELIABILITY COEFFICIENTS FOR THE ATTITUDE INVENTORY 
AND THE QUESTIONNAIRE FOR COLLEGE TEACHERS 


Attitude Inventory Questionnaire 
Scores Split-halves r Corrected r Split-halves r Corrected r 
Area 1 .70 82 mà $7 
Area 2 .38 55 47 64 
Area 3 81 -90 72 84 
TOTAL 175 :85 83 91 


attitude toward the technical aspects of teaching, the reliability coefficients of 
the subscales in both instruments indicate that they may be used independently 
of the total scores as measures of the attitudes of college teachers toward their 
students and toward student ratings of teacher performance. 

А comparison was made of the reliability coefficients for the Questionnaire 
for College Teachers (ОСТ) which use the direction-of-perception technique, 
and the Attitude Inventory ( AI), an example of the Likert method of attitude 
measurement. The results are shown in Table 2. Based on total scores, the 
reliability of the Questionnaire is significantly higher than that of the In- 
ventory at the .05 level. There is no significant difference in the reliabilities 
for Area 1. The reliability for Area 2, though comparatively low, is higher 
(at the .05 level) on the Questionnaire. For Area 3, the reverse is true; the 
reliability of the Inventory subscale is significantly higher (at the .05 level) 


than that of the Questionnaire. 
TABLE 2 
COMPARISON OF RELIABILITY COEFFICIENTS FOR QUESTIONNAIRE 
E COLLEGE TEACHERS AND THE ATTITUDE INVENTORY 
Оѕімс FisHer’s z FUNCTION 


Difference 

Questionnaire Inventory (S.E.p = .11) 
Area 1 1.33 1.16 7 
Area 2 76 IM 26* 
Area 3 1.22 pi cR 
TOTAL 1.53 1.26 27 


* Significant at .05 level. 
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The results of the correlation of the total and area scores on the Question- 
naire for College Teachers with those of the Attitude Inventory are shown in 
Table 3. An r of .75 between total scores makes it possible to reject (at the 

TABLE 3 ` 
CORRELATION OF TOTAL AND AREA SCORES OF THE QUESTIONNAIRE 


FOR COLLEGE TEACHERS WITH CORRESPONDING 
SCORES ON THE ATTITUDE INVENTORY 


Area 1 Area 2 Area 3 TOTAL 
r .78** 31° :67%% .75%% 


* Significant at .05 level. 

** Significant at .01 level. 
:01 level) the hypothesis that the correlation between the Likert-type attitude 
scale (AI) and the direction-of-perception attitude inventory (QCT) is 
zero. Significant correlations between the scores of subscales also make it 
necessary to reject the hypothesis that the correlations between subareas on 
the Likert-type attitude scale (AI) and the corresponding subareas on the 
direction-of-perception scale are zero. For Areas 1 and 3 this hypothesis is 
rejected at the .01 level; for Area 2, at the 5.0 level. From these results it 
appears that there is a definite relationship between the Likert method of 
attitude measurement and the direction-of-perception technique. 


Five methods were used in preparing the responses to the items on the 
Questionnaire’ for College Teachers. From the results reported in Table 4 it 
is evident that each of the five methods produces results comparable to those 
obtained for corresponding items on the Likert-type Attitude Inventory. An 

TABLE 4 
CORRELATION oF ITEMS ON ОСТ GROUPED ACCORDING TO METHOD USED 


IN FORMATION OF THEIR RESPONSES WITH CORRESPONDING 
ITEMS IN THE ATTITUDE INVENTORY 


Methods 
a b с d e 


r .67** 598% .58** 340% gee 


** Significant at .01 level. 


analysis of the differences between these r’s, as shown in Table 5, results in 
only one significant difference—that between Method a (foils separated by 
a single value) and Method d (foils separated by 4-6-4 percentage points). 
Before Method d is discarded as a means of formulating direction-of-percep- 
tion foils, it should be considered that most items of this type are in Area 2 
and that the lower reliability of this area may have had an adverse effect on 
the comparison of these items with corresponding Likert items. No difference 
between the remaining four methods was significant (at the .05 level). 


Ра 
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TABLE 5 
DIFFERENCES OF "METHODS" r EXPRESSED IN FISHER's g 
Methods 
a b с 4 е S.E.p=.17 
а 4% 15 46%* 13 
b 02 33 .00 
с 31 —.02 
d —.33 


** Significant at .01 level. 


One of the difficulties mentioned in relation to the direction-of-perception 
method was the difficulty in formulating items (6). In many cases data are 
not available to serve as a basis for the preparation of responses that are 
equidistant from the truth. Since the items in this study, none of which were 


based on actual data, were significantly related to the corresponding Likert- 


type statements, it is evident that lack of data need not prohibit the use of 


the direction-of-perception technique. 

One other point to be considered is that both Hammond (2) and Wechsler 
(6) combined factual and nonfactual items on their attitude instruments. 
This procedure resulted in comparatively long tests, but in a small number 
of attitude statements. Hammond (2), for example, suggested a ratio of two 
factual questions to each nonfactual or attitude item. Аз a result of the small 
number of attitude items, the reliability coefficients for these instruments were 
relatively low. No factual items were included in the Questionnaire for 
College "Teachers, and all the reliability coefficients (with the exception of 
Area 2) were higher than those reported by Hammond (2) or Wechsler (6). 
It appears that a combination of factual and nonfactual items is not necessary 
in the construction of an attitude instrument using the direction-of-perception 


method. 
D. SUMMARY 


one using the Likert method and the other the 


direction-of-perception technique, were developed to measure the attitudes 
of college teachers toward college students, the technical aspects of teaching, 


and student ratings of teacher effectiveness. The reliabilities of these instru- 
onses obtained from college and university 


ined from resp 
ub cate jo cda states. Eighty-five teachers returned the Question- 
naire for College Teachers (an example of the direction-of-perception tech- 
nique) and 74 returned the Likert-type Attitude Inventory. The reliability 
coefficients for Areas 1 and 3 and for the complete instruments were satis- 
factory. A lower reliability for Area 2 suggests that caution be used in basing 
any conclusions upon data derived fro 


Two attitude inventories, 


m it. 
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The data obtained from the administration of these three instruments were 
inalyzed as follows: (a) the relationship of the Attitude Inventory and the 
Questionnaire for College Teachers was studied in terms of the total scores 
and the scores for each attitude area and the reliability coefficients; (5) 
scores grouped according to the methods used in preparing the answers to the 
direction-of-perception questions were compared with scores of corresponding 
items on the Likert-type scale. 

For the population defined in this study the data support the following 
conclusions: 

1. The Questionnaire for College Teachers and the Attitude Inventory are 
reliable measures of attitudes toward college students and toward student 
ratings of teacher performance. The area on both instruments designed to 
measure attitudes toward the technical aspects of teaching should be further 
refined before data derived from its administration can be used with assurance. 

2. Attitude scores as measured by the direction-of-perception technique are 
significantly correlated with attitude scores measured by the Likert method. 
These results, plus the adequate reliabilities on two of the three subscales 
suggest that the direction-of-perception technique is a reliable method of 
measuring attitudes. 

3. All of the five methods used in formulating responses to the questions on 
the Questionnaire for College Teachers produced responses comparable to 
those obtained on the Attitude Inventory. It is not necessary to have factual 
data as a basis for preparing foils or to combine factual and nonfactual items 
in order to have a reliable instrument. 
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DISCRIMINATION OF GEOMETRICAL PATTERNS IN 
TRIBAL, RURAL, AND URBAN CHILDREN* 


Utkal University, Bhubaneswar, India 


P. C. Nanna, J. P. Das,! Амр H. К. MISHRA 


A. INTRODUCTION 


Does past experience with a certain pattern alone always lead to its correct 
discrimination from another pattern closely resembling the original? Is this 
true even if distinct verbal labels are not available for the two patterns? In 
the present study, three samples of children from three separate cultural 
environments were tested for their ability to discriminate between the follow- 
ing pairs of figures: circle vs. ellipse, rectangle vs. parallelogram, obtuse- 
angled vs. acute-angled triangle, and a circle within a triangle es. а triangle 
within a circle. The children ranged in age between 6 and 10 and lacked 
appropriate verbal labels for the test figures. 

The study was conducted in three parts, starting with the children from 
an aboriginal tribe in the southern part of Orissa, India. In the summer of 
1962, the three authors went together to the interior of a tribal area. As their 
entry into a tribal village attracted adults and children alike, one of the 
authors kept the villagers amused with view-master pictures while the other 
two authors tested subjects in groups of five. In the remote villages visited, 
any group of strangers could be perceived by the children as official vaccinators, 
therefore, the children's fears had to be allayed by a generous distribution of 
sweets. Rapport with the parents of the tribal children was easily established 
by the authors introducing themselves as teachers from a "big school." Rapport 
was essential, so that shy children would be encouraged by their parents to 
participate in the experiment. No particular problem was encountered in 


obtaining the rural and urban samples. 
B. METHOD 
1. Experimental Design 


Each of the three samples was divided into eight experimental groups, 
following the requirements of a two-by-two-by-two factorial design in a man- 


* Received in the Editorial Office, Provincetown, Massachusetts, on March 4, 1964, 
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ner to be indicated. The four pairs of geometrical figures were classified 
under two broad categories as follows: (a) parallelogram vs. rectangle and 
ellipse vs. circle and (5) obtuse-angled vs. acute-angled triangles and circle 
within triangle vs. triangle within circle. The figures in Category (a) were 
presented to a group of Ss half of whom had the parallelogram and ellipse 
as cues associated with reinforcement, whereas the other half had rectangle 
and circle. A similar design was followed for the stimuli in the second 
category. Further, the authors made an attempt to determine whether or not 
filled figures (inside area blackened by India ink) were more easily dis- 
criminated than unfilled figures (contours drawn with India ink on a white 
background). Thus, each of the categories was presented in two versions, so 
that eight experimental groups were required. Five children in each sample 
were randomly assigned to each of the eight cells. 


2. Subjects 


The tribal children were from two tribal villages. The villages can be 
called remote because they are not approachable by roads and do not have an 
elementary school or hospital within a distance of five miles. The children 
were illiterate and could never have come across geometrical pattern in print. 
Children between the ages of 6 and 10 were to be chosen, but a correct state- 
ment of age could not always be obtained from parents, Therefore, the 
authors selected as Ss children who had cut at least one new tooth (ap- 
proximate age about 6 years) but who had not cut all of their new teeth 
(about 10 years). 

The illiterate-village sample was selected from a remote nontribal village 
where most children do not go to school and the percentage of illiteracy in 
the adult population is high. The authors would have liked to test urban- 
illiterate children comparable in socioeconomic status to the tribal and rural 
children. But school going is so common in the towns that the authors could 
not find sufficient children to qualify for the desired sample. 


3. Procedure 


As indicated earlier, subjects were tested in groups of five with two pairs 
of stimuli in two experimental sessions 30 minutes apart. In one of the ses- 
sions, Ss were required to discriminate between the elements of a pair of 
stimuli (e.g., parallelogram vs. rectangle) in a continuous session during 
which the pair was presented for a maximum number of 16 trials or until 
S reached a criterion of five consecutively correct responses. A stimulus pair 
for discrimination was presented on a tray with slots, so that 8 could be 
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instructed to pull out the figure that he thought concealed a sweet. $ was 
allowed to collect the sweet and save it until the experiment was over. As 
mentioned earlier, the groups were counterbalanced on the reinforced figure, 
so that half of the Ss received a sweet under, say, parallelogram ; while the 
other, under a rectangle. Ss who had attempted to discriminate between a 
parallelogram and a rectangle were not asked until 30 minutes later to begin 
their second task: i.e., discriminating between a circle and an ellipse. 


C. RrsULTS AND Discussion 


S’s responses were scored in two ways: (a) failure to discriminate within 
16 trials and (b) the number of trials required to reach the criterion of five 
consecutive correct responses. Failure to discriminate was assigned a score of 
16. 

To prepare a table suitable for chi-square analysis, the two conditions of 
reinforcement were neglected; otherwise the expected frequency per cell 
would have been too small. Further, the results of reinforcing the circle, for 
example, do not seem to differ from those when the ellipse was reinforced. 
The total number of failures for the urban, rural, and tribal samples were 
12, 34, and 62 respectively [see Table 1]. With two degrees of freedom, chi 


TABLE 1 
NUMBER ОЕ FAILUnES* 
Urban Rural Tribal 
Type of figure (a) (2) (а) (b) (a) (2) 
"i 3 4 11 6 12 1$ 
United 1 4 10 ў 20 16 


* See text, p. 198, for definitions of Categories (а) and (b). 


square (34.89) is significant beyond the .01 level; therefore, ability to dis- 
criminate geometrical patterns varies directly with the degree of urbanization, 

In a three-by-two-by-two analysis of variance without replication [see (1)], 
the reinforcement-treatment level was ignored because of a large number of 
failure scores. To avoid undue strain on the assumptions of analysis variance 
individual scores were pooled for each cell, and the triple interaction between 
samples (filled vs. unfilled figures) and Categories (а) and (b) was taken 
as the error term. No significant Ё was obtained [see Table 2]. 

The above results suggest that a more sensitive measure, like the number of 
trials required to discriminate the figures, fails to show any difference in the 
Performance of the three cultural groups. But the distribution of the total 
number of Ss failing to discriminate suggests geometrical patterns are more 
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TABLE 2 
SUMMARY OF ANALYSIS OF VARIANCE 
Source df MS F* 

Samples 2 36457. 7.083 
Туре} 1 1200. 
$һаре} 1 363. 
Interaction 

Samples-by-type 2 1027. 

Samples-by-shape 2 2428. 

Type-by-shape 1 33.3 

Samples-by-type-by-shape 2 5147. 
Total 11 86825. 


* Except for the one value given (7.083), no F is larger than one. 

T Filled figure vs. unfilled figure. 

X Category (a) vs. Category (b). See text, p. 198. 
efficiently perceived by children from “civilized” cultures. The level and T" 
not the speed of learning turns out to be a reliable measure for use in the 
crosscultural samples. 
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EFFECT OF DIFFERENTIAL VALUATION ON GROUP LEVEL 
OF ASPIRATION, DECISION TIME, AND PRODUCTIVITY* 


Queens College, Wagner College, and University of Pennsylvania 


Florence Denmark, Dororuy Mourcarroyp, AND ALBERT PEPITONE 


A. INTRODUCTION 


Two aspects of human-aspiration behavior are (а) the level at which goals 
are set by individuals and (2) the persistence that individuals show in setting 
goals for themselves. Both kinds of behavior have been shown to be sensitive 
to social influence: for example, the level at which task goals are set is 
strongly affected by information given to the individual concerning the per- 
formance in that task by groups of known status levels [Chapman and Volk- 
mann (1)]. The persistence of aspiration, in terms of how many trials the 
individual continues to set task goals, is affected by the direction and magnitude 
of the discrepancy between his performance and that of a peer group [Dreyer 
(2)]. In both instances, the individuals can be said to conform to the social 
influence; however, the latter may be considered entirely passive because there 


has been no attempt to exert influence. 

Looking at aspiration behavior as a case of social conformity, a number of 
questions arise analogous to questions of opinion and attitude conformity: for 
example, can aspiration behavior also be affected by active social influence? 
Then, too, active social influence presumably could occur on different levels: 
for example, direct influence refers to the instance in which one individual 
Persuades another to set a goal at a given level. An indirect level of active 
influence might occur by way of an evaluation of the individual’s ability. The 
focus of the present study is on this more subtle social determinant of aspira- 
tion behavior. 

Our working hypothesis was that the level of difficulty of a task goal to 
which an individual aspires, in the absence of any experience with the task, 
is a function of the individual’s relevant ability, as evaluated by a competent 
authority. The second question was whether or not such influence represents 
change in self-evaluation. Clearly, a shift (or difference between conditions) 
in aspiration could be a response that accommodates the source of influence 
only so long as the latter is psychologically present, can mete out punishment 


ts! 
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for noncompliance, or keeps the individual dependent upon him for evaluation. 

Theoretically, a person’s self-evaluation depends upon his ability, as reflected 
by achievement; however, such measurement presupposes either an objective 
score on the task or the opportunity for comparison with other individuals, 
When such standards are lacking, achievement (hence, the individual’s abil- 
ity) is not interpretable. To the extent that achievement approaches an au- 
thoritative evaluation or if achievement cannot be evaluated, one would pre- 
dict acceptance of the influence; that is, self-evaluation should parallel aspira- 
tion behavior, and both would be a function of the external evaluation of 
the individual. 

B. HYPOTHESES 


1. When informed that they have high ability with regard to a particular 
task, groups who lack knowledge about their ability for that task choose 
higher levels of difficulty at which to perform the task than do groups who 
have been told by a competent authority that their ability is average or low. 

2. When informed that they have average ability with regard to a particular 
task, groups who lack knowledge about their ability for that task choose 
higher levels of difficulty at which to perform the task than do groups who 
have been told by a competent authority that their ability is low. 

3. When informed that they have high ability with regard to a particular 
task, groups who lack knowledge about their ability for that task rate their 
own ability to perform the task (self-evaluation) as greater than do groups 
who have been told by a competent authority that their ability is average or 
low. 

4. When informed that they have average ability with regard to a partic- 
ular task, groups who lack knowledge about their ability for that task rate 
their own ability to perform the task (self-evaluation) as greater than do 
groups who have been told by a competent authority that their ability is low. 


C. METHOD 


1. Subjects 


'The subjects (Ss) in this study were 60 pairs of college women enrolled 
in general psychology at the University of Pennsylvania. By participating, 
the students were excused from the preparation of one laboratory report. 


2. Procedure 


Ss filled out a personal-data questionnaire and took a timed spatial-relations 
test during the recruitment session. They were informed that the experimenter 
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they work together at it. The personal-data sheet requested information on 
items such as average grade in high school and college, major subject in college, 
extracurricular activities, the kind of person $ was, and the kind of person § 
would like to work with. Ss were paired at random without reference to any 
of the preceding information. 

4. Experimental sessions. Each pair of Ss met by appointment with E to 
Work on a card-sorting task. At that time, $$ were informed that the par- 
ticular task involved had proved useful in previous studies investigating the 
way in which people reach decisions and work together. The task was that 
that of sorting 35 specially designed cards, permanently mounted on a large 
cardboard. Each card could be classified according to four characteristics: the 
size of the card, the size and density of the dots covering the surface of 
the card, the number of straight lines forming an asterisk-like figure on each 
card, and the pattern of points where each of three heavy black lines reached 
the edges of each card. Ss were to sort each of these characteristics into as 
many different categories as they decided were necessary, working on one 
characteristic at a time. 

Next, Ss were presented with a typed sheet containing presumed levels of 
difficulty for performing the task. They were informed that an earlier study 
had established certain orders of working on the task as more difficult than 
others, For instance, an order that began with categorizing the size of the 
cards was more difficult than one that began with categorizing asterisks, 
Only one sheet was given to the pair of Ss, so that interaction would be in- 
duced by the necessity of the Ss working together, Nine levels of difficulty 
Were typed on the sheet, with No. 1 designated as the most difficult and 
No. 9 as the least difficult. The other levels supposedly were scaled between 
the extremes. To emphasize the presumed scaling of difficulty and to give 
a further basis for discussion, two other scales were presented along with the 
nine levels. E then told Ss that, upon comparing the information on their 
Personal-data sheets and their scores on the spatial-relations test previously 
administered with those of others who had already performed the task, they 
Were judged to have high, average, or low ability. 

Next, Ss were instructed that they had to reach a joint decision about the 
Order or level at which they wished to work, taking into consideration the 
various risks involved. E left the room while Ss reached their decision, and 
the time required was recorded by an observer. When the decision was 
Teached, E returned and started Ss on the task. Ss worked together, telling E 
by number what cards they wished placed in each of the categories they had 


204 JOURNAL OF SOCIAL PSYCHOLOGY 


decided upon. E kept a record of the categorized cards and made various 
observations during the decision making and the work session. The time limit 
for the task was 20 minutes. 

At the end of 20 minutes, a questionnaire was administered to determine 
to what degree the experimental induction had been effective. Questions con- 
cerned self-estimates of ability regarding the task, the amount of attraction 
to the group, and whether or not prior acquaintance with the partner existed. 


D. RESULTS 


Table 1 shows the level of difficulty, decision time, and productivity for 
each of the treatment groups. Analyses of variance yield no Fs significant at 
TABLE 1 


Errects ОЕ ABILITY INDUCTION ON LeveL or DIFFICULTY, Decision TIME, 
AND PRODUCTIVITY 


Treatment 
High Medium Low 

Variable (N=20) (N=20) (N=20) Fe 
Level of difficulty (mean level chosen) 2.1 4.3 3.9 9.17 
Decision time (in seconds) 54.2 88.1 80.9 3.03 
Gross productivity (number of cate- 

gories sorted) 88.5 84.7 92.4 .50 
Net productivity (gross productivity 

minus errors) 70.7 71.9 754 42 
Errors (number incorrectly sorted) 12.8 12.8 17.0 56 


* Evaluation of the significance of the overall variation among conditions is based 
on an F test with two and 57 degrees of freedom. Comparisons between treatments 
are based on two-tailed 7 tests. For level of difficulty, the overall test is significant 
at the .01 level. The difference between the high-ability group and the medium-ability 
group is significant at the .01 level, and the difference between the high-ability group 
and (08 low-ability group is significant at the .02 level. No other difference is sig- 
the .01 level for gross productivity, net productivity, or errors. A significant F 
appears for the choice of a level of difficulty. Individual ? tests give significant 
differences between the high-ability and average-ability treatment groups as 
well as between the high-ability and low-ability treatment groups. 'This same 
pattern tended to occur for decision time, although the difference was not 
statistically significant. . . 

"Table 2 gives the mean scale value for each group on the final questionnaire 
responses and it confirms the experimental induction in regard to ability as 
having been effectively perceived by the Ss (6 < 01). Мо significant differ- 
ences were found between the three treatment groups for liking the other part- 
ner and degree of previous acquaintanceship, because the intention was to con- 
trol these factors. An important result is the valuation the subjects put on their 


m 
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TABLE 2 
MEAN RESPONSE SCORES ON FINAL QUESTIONNAIRE 
"Treatment 
High Medium Low 
Question (N—40) (N=40) (N=40) Fe 
1. Like to work with same partner. 1.9 24 1.6 2.16 
2. Like partner in social situation. 2.8 2.6 2.1 2.10 
3. Self-estimate. 3.1 3.6 3.6 2.38 
4. Experimenter’s judgment. 2.1 3.8 5.0 9.67 
5. How well did you know partner? 5.2 5.0 +7 2.00 


* Evaluation of the significance of overall variation between conditions is based 
on an F test with two and 117 degrees of freedom, Comparisons between treatments 
are based on two-tailed tests. For experimenter’s judgment, the overall test is sig- 
nificant at the .01 level; so is the difference between the high-ability group and the 
medium-ability group, the difference between the high-ability group and the low-ability 
group, and the difference between the medium-ability group and the low-ability group. 


No other difference is significant. 

own ability. No significant differences appeared between the three groups, 
indicating that contrary to the verbal induction of E, Ss themselves took a 
medial position in regard to their ability. 

Rank-difference correlations over all treatment groups between level of 
difficulty and productivity, level of difficulty and self-evaluation, level of diffi- 
culty and decision time, and self-evaluation and productivity were examined 
and reveal a significant rho of .39 between level of difficulty and decision 
time only. Differences in variability between the three treatment groups for 
level of difficulty, decision time, gross and net productivity, and errors were 
also tested. The greatest variability occurred for level of difficulty, but this 


F did not reach the .05 level of significance. 
E. Discussion 


Table 1 confirms Hypothesis 1. Ё'з arbitrary valuation was effective in 
influencing the original decision of the high-ability and medium-ability groups 
in regard to their level of difficulty. 

That the mean of the low-ability group was slightly higher than that for 
the medium-ability group fails to confirm Hypothesis 2 and suggests that 
such influence may be asymmetrical. Ё can influence aspiration upward but 
not downward. Such seems to have been the case in Dreyer 1954 study (2). 

According to level-of-aspiration theory, in the initial choice situation the 
strength of the force to choose high levels of difficulty can be assumed. to be 
dependent on the positive incentive value of success: i.e., the objective öt 
material rewards of success plus subjective feelings of accomplishment if 
successful. Opposing this is the strength of the force to avoid difficult levels, 
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assumed to be dependent on the negative incentive value of failure: i.e., fear 
of failure, plus the extra effort involved in more difficult tasks. Both forces 
may be modified by subjective probabilities of success and failure, which 
might be again assumed to be some function of perceived task difficulty-ability, 
as noted by Lewin (5). However, with no previous knowledge of task diffi- 
culty or ability on the part of the individual or group, the 50-50 probability 
of success and failure then corresponds to the most-probable achievement (6). 
Thus, choice of a level of difficulty should be made in the middle range of a 
series of choices where the probabilities of success and failure are 50-50 under 
these conditions. 

Escalona’s Resultant Valence Theory (6) takes into account the greater 
positive valence of success in this culture, as does Festinger who, in his social- 
comparison processes (3), postulates a unidirectional drive to evaluate abil- 
ities upward in Western culture. Such a drive would act not only as a force 
for $ to choose more difficult levels but also as a restraining force against S’s 
choosing easier levels. In the presence of this additional factor, then, the 
resultant choice of a level of difficulty would be above the point at which the 
probabilities of success and failure are 50-50. 

In the present study, the experimental induction of high ability had the 
effect of reducing the force on Ss to avoid difficult levels because it implied 
that the probability of failure at those levels was minimal. The principal force 
remaining was upward and decision was made accordingly. 

The medium-ability induction appeared to have minimal effect on the forces 
already operative in the situation. Since the medium-ability groups chose a 
level slightly above the middle range of difficulty, the drive to evaluate abil- 
ities upward apparently influenced their decisions to this degree. For the low- 
ability Ss, the results of the induction were not as expected. Ss in this treat- 
ment group chose the highest levels of difficulty as well as those in the middle 
range. Since choice of high levels of difficulty was not true of medium-ability 
Ss, something more than the drive to evaluate abilities upward appears oper- 
ative. The unfavorable evaluation may have created a state of dissonance 
for the individual, as formulated by Festinger (4), and this choice behavior 
may have been an attempt to reduce dissonance. The tendency for anxious 
persons to set either extremely high or very low aspiration levels when easy 
success or no chance for success minimizes anxiety about failure was noted in 
early studies (6). Those low-ability Ss who chose the highest levels appeared 
to behave in a similar fashion. When such Ss were questioned postexperi- 

mentally, it appeared that their choice behavior was defensive in character. 
For instance, they almost always failed to mention the induction as influencing 


—— 
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their choice of level of difficulty; whereas Ss making other choices almost 
always did so. Other Ss in the low-ability groups appeared to have taken the 
experimental induction into account to some degree, but no group chose one 
of the lowest levels, a “floor effect” apparently stemming from the drive to 
evaluate abilities upward. Such Ss, however, would be expected to experience 
increased tension, and this expectation is borne out by the significant correla- 
tion between decision time and level of difficulty. In the case of the high-ability 
group, the induction facilitated their decision; while for the unrealistic low- 
ability Ss refusal to take the evaluation into account also resulted in a quicker 
decision time. 

In regard to gross and net productivity and errors, the productivity of the 
three groups cannot be expected to be influenced by their aspiration if the 
task is above a critical level of difficulty or below a certain level. Clearly, 
if the objective constraints are strong, differences in motivation are irrelevant, 
By the same token, if the task is simple, variations in drive will not show up. 
It is only within a middle range of difficulty that aspiration differences would 
be permitted to be reflected in productivity scores. That the task was neither 
too difficult nor too simple is shown by the wide variation in the scores 
achieved by the Ss. However, from Table 1 it is apparent there is no overall 
difference in productivity across groups. There is a slight decline from high 
ability to medium ability, and a rise of almost eight points from medium ability 
to low ability. The latter again suggests the possibility of asymmetrical in- 
fluence, which in this case is revealed by an extra, compensatory push for the 
Ss who are evaluated low. В | 

Concerning errors, there is also no significant difference among experimental 
conditions, The sharp increase in errors on the part of the low-ability Ss again 


suggests the presence of a compulsive indiscriminate surge. 

In regard to Hypotheses 3 and 4 concerning the Ss’ self-evaluation (i.e., 
the question of the actual acceptance of influence), Question 3 in Table 2 
reveals that, contrary to the hypotheses, there is no overall difference across 
Conditions, Since the questionnaire was administered after the task had been 
attempted, Ss’ experience with the task may have affected their ratings or the 
influence may have been temporary. This former possibility was tested, but 
No significant correlation was found between self-evaluation and productivity. 
The Possibility that self-evaluation may have influenced level of aspiration was 
also examined ; but here, too, no significant correlation between self-evaluation 
and level of difficulty was found. It appears that the Ss self-evaluation did 
not depend upon actual productivity for the task nor was their self-evaluation 
related to level of difficulty. Verbal comments showed Ss at all levels felt 
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little pressure to work for a high score under the motivational conditions in 
this experiment. 


F. Summary AND CONCLUSIONS 


The effect of arbitrary social evaluation as to ability on group level of 
aspiration was studied with 60 college women, who met in pairs with E to 
work on a card-sorting task. Three different levels of ability—high, medium, 
and low—were attributed to the Ss ostensibly on the basis of a spatial-relations 
test and personal-data sheet before Ss jointly chose a level of difficulty at 
which to perform the task. j 

Choice of level of difficulty differed significantly only for the high-ability 
and medium-ability treatments. The level chosen by the low-ability treatment 
groups was slightly higher than that chosen by the medium-ability groups, 
but the difference was not statistically significant. No group chose a level 
at or near the bottom of the scale, but there was a tendency for the low-ability 
treatment groups to be more variable in their choice than either of the other 
two groups. This tendency did not reach statistical significance, but was 
indicative of unrealistic goal setting under the influence of this induction. 

The possibilities that level of aspiration may have been influenced by self- 
evaluation rather than the experimental induction or that actual performance 
on the task may have influenced self-evaluation were checked, but no sig- 
nificant correlations were found. A significant correlation between the diffi- 
culty level selected and the amount of time taken for decision appeared over 
all groups. Those Ss who chose higher levels of difficulty required less time 
for decision, regardless of the induction. No differences appeared between the 
three treatments for gross or net productivity, number of errors, or attraction 
to the group. 

From this study it may be concluded that social influence affects group 
decision in regard to level of difficulty in line with its evaluation (even though 
arbitrary) when no other information in regard to ability is available only 
if the evaluation is not counter to the cultural drive to evaluate abilities up- 
ward. However, such influence appears noneffective in changing productivity 
at any level when motivation is otherwise minimal. 
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BIAS IN EVALUATING THE PRESIDENTIAL DEBATES: 
A “SPLINTER” EFFECT*#1 


Communication Research Division, Boston University 


RareH L. Rosnow 


А. PROBLEM 


In the time since we have become aware of the possibility that the 1960 
Presidential debates may have swung victory to one side and away from the 
other, many investigators have implied the need for a full consideration of 
pertinent variables if the various psychological effects are to be teased out of 
What has popularly become the ritual of political debating. In the present 
Paper, we shall be concerned with a problem in evaluating future presidential 
debates, more specifically with the role of two variables in biasing evaluation. 

A multiplicity of variables conceivably might complicate a design for such 
àn evaluation and, though much research has already been carried out, the 
impact on the judgmental processes of an observer of such things as his per- 
Sonal commitment to the stimuli he is evaluating, the order of presentation of 
Stimuli to be evaluated, the observer's prior familiarity or unfamiliarity with 
the stimuli, problems in pretesting's dispositionally sensitizing observers to 
Опе or the other stimulus, and a host of other factors have not been worked 
Out in detail with regard to the issue at hand. 

The present paper deals with one of these problems—the effect of a person's 
"commitment" to a political party or to a political candidate on fiis judgments 
of political candidates. One question to be answered is whether politically 
committed observers, if so instructed, may be expected dispassionately to 
evaluate their own and the opposition’s candidate on any given criterion of 
effectiveness, If it can be demonstrated that they are not so able, then any 
evaluative design must have built into it a component to ameliorate such error. 


xc bilis 
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In another context, the problem of personal commitment biasing personal 
judgmental processes has been experimentally investigated by Bass and Dunte- 
man (1). They point out that examples of unrealistic overappraisal or under- 
estimation of nations and of political situations are strewn throughout history. 
They note the United States’ overevaluation of the worth of its collaborators 
in the Bay of Pigs incident, this country’s underestimation of the Japanese 
during World War II, and the overevaluation in 1939 by England and France 
of each other but the underevaluation of their common enemy Germany. By 
experimentally producing a laboratory situation of conflict between allied and 
competing groups, Bass and Dunteman were able to demonstrate empirically 
a consistent underestimation by Ss of their competitors but a generally incre- 
mental valuation of allied groups. Results were consistent with those obtained 
in a similar study by Blake and Mouton (2), and it was the implication raised 
by these findings that led to the exploratory study that is presently to be 
described. 

B. PROCEDURE 


The study was carried out in July of 1963 to determine the effect on an 
observer of his commitment to a political party or to a particular party candi- 
date. The dependent variable was the bias in the observer's evaluations as 
debaters of his own and the opposing party's presidential candidate. Forty- 
nine persons took part in the study (26 males and 23 females). Ages ranged 
from 18 to 52 years, the median was 29. Sixty-three per cent of the sample 
had at least three years of college, and 27 per cent had some graduate training. 
'The sample included professional people, clerical and research assistants, 
administrative people, police officers, and undergraduate college students. 

An experiment was carried out on independent groups consisting of two to 
four persons. $s were seated beside one another in a single row separated 
laterally by approximately two feet and faced a tape recorder. A response 
apparatus was fastened to the arm of each §’s chair. The response apparatus 
was enclosed in a metal box (about the size of a shoe box) into which S could 
place his hand (thus shielding his responses from his neighbors) and grasp 
a lever that could be moved in a horizontal arc. Ss were instructed that moving 
the lever all the way to the left would register an extreme negative reaction ; 
moving it all the way to the right, an extreme positive reaction; and the 
center position would register neutral. The lever was spring loaded to return 
to the center position. Though Ss were led to believe that their responses were 
scored on a continuous scale for intensity of feeling, responses were in fact 
recorded on a discrete scale with five categories of judgment, for each of 

which the duration of reaction time was independently recorded. Reaction 
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was measured continuously in time concurrent with continuation of the stim- 
ulus, and total time was checked by comparing the summed reaction time for 
each $ against an independent total-time counter. This experimental apparatus 
was a modification of the Lazarsfeld-Stanton Program Analyzer. Where the 
Program Analyzer records a dichotomous response—like, dislike—the present 
apparatus permitted a continuous and variable intensity automated response: 
hence, its name, Caviar? 

After Ss had been seated and introduced to E, each 8 completed an 11-item 
paper-and-pencil test designed to measure the intensity of his political party 
commitment and his relative commitment to the two major presidential 
candidates for 1960. A consistent attribution of opinions and preferences in 
a direction favoring either one political party or one political candidate was 
taken as indicating, respectively, high commitment to party or high commit- 
ment to candidate. Inconsistency in the direction of opinions and preferences 
was taken as indicating either commitment lower in the individual's response 
hierarchy or no commitment at all, depending upon the extent of the in- 
consistency. 

Following completion of the questionnaires, E told Ss that they were about 
to hear excerpts from the second Nixon-Kennedy debate of October 7, 1962. 
They were instructed to listen carefully both to the questions asked each of 
the candidates as well as to the candidates’ answers. They were told to 
disregard their own personal feelings and to rate the “quality” of the candi- 
dates’ answers, judging these on the basis of logic, clarity, and conciseness. 
Ss were instructed to regard the overall quality from a debater’s point of 
view, considering whether or not the candidates’ answers hit home, whether 
or not they struck a telling blow against the opposition, or whether or not 
the answers were longwinded, hollow, unclear, or perhaps more laden with 
emotion and demagogy than with sound logic. Did the candidate’s statements 
leave the debater in an indefensible position? Was he now vulnerable to 
attack by the opposition? And so on. | 

After a brief opportunity to practice with the response apparatus, Ss listened 
to an edited tape recording of the second Nixon-Kennedy debate. The tape was 
edited in a manner such that one candidate responded to four questions, fol- 
lowed by a rest period of approximately three minutes (permitting Ё to reset 
the recording apparatus), after which time the answers to four questions by 
the other candidate were presented. The tape was approximately 15 minutes 


2 The present version of the Program Analyzer was developed by the Bureau 
of Social Science Research, Inc., under contract SRE Directorate for Armed 
Forces Information and Education, Department of Defense. 
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long, with about an equal amount of time allotted to each candidate, In order 
to balance out any complicating order effect, half of the Ss heard Nixon first; 
while the other half heard Kennedy first. 

Ss’ responses in each of five categories of judgment— "extremely good," 
"somewhat good," "neither good nor poor," "somewhat poor," "extremely 
poor"— were independently recorded on an apparatus located in a room 
adjacent to the experimental room. Continuous measurements were taken on 
each S, the duration of responses in each of the categories measured to an 
accuracy of .1 second. To simplify later statistical computation and to clarify 
the meaning of scores, Ss’ responses were converted to a five-point Likert-type 
scale; that is, the response "extremely good" was considered as a numerical 
score of five, "somewhat good" as four, and so on. Two evaluation indices 
(EI) were calculated for each § from his independent ratings of the candi- 
dates, yielding an overall rating for each S of the quality of each of the 
candidates as debaters. The evaluation index was adapted from the median 
formula for grouped data. EI was defined as a measure of central value on 
the five-point scale such that above or below that measure the duration of 
reaction time was 50 per cent of the total reaction time. The general formula 
was as follows: EI = 2.5 + [32/2 — (tı + t2) ]/ta where 2.5 is the lower 
limit for one interval in which the central measure usually fell. The symbols 
11, 12, and fg represent categories "extremely poor," “somewhat poor," and 


TABLE 1 
ANALYSES OF VARIANCE FOR COMMITMENT 
Source SS df F 

High commitment to party vs. low com- 

mitment to party 046 1 393 
Nixon vs. Kennedy 525 1 4.487" 
Interaction 

Party commitment X Candidates .179 1 1.530 
Within cells 7.025 60 

Total 7.775 63 
High commitment to candidate vs. low com- 

шїн to candidate 010 1 .084 
Nixon vs. Kennedy 114 1 .958 
Interaction 

Candidate commitment X Candidates 075 1 630 
Within cells 7.134 60 

Total 7.333 63 


* p< 05. 
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"neither good nor poor." X represents the total reaction time, summed for the 


five categories. 
C. RESULTS 


Out of the sample of 49 persons, a cluster of 23 scored high on party com- 
mitment, and a cluster of nine scored low. The scores were significantly dif- 
ferent from one cluster to the other ( < .05), and a two-factor analysis 
of variance was computed from the EJ scores for these people. Each factor 
consisted of two levels—one, high commitment and low commitment to party; 
the other, ratings of Nixon and ratings of Kennedy. These results are sum- 
marized in Table 1. In addition, a two-by-two analysis of variance was com- 
puted from the EI scores for significantly different clusters of Ss scoring 
high and low on commitment to candidate. These results are also summarized 
in Table 1. The relationship between party-commitment and candidate- 
commitment scores was determined by computing a Pearson product-moment 
correlation on these pretreatment questionnaire responses. The scores were 


not significantly related to each other, r = .139. 
D. DiscussioN 


The two analyses of variance summarized in Table 1 indicate a single 
effect significant (beyond the .05 level) for Ss pooled on the basis of their 
intensity of party commitment, but nothing for Ss pooled on the basis of 
candidate commitment. This finding is particularly interesting because from 
à common-sense point of view one could argue that high commitment to 
Party or high commitment to candidate or both could relate to a person's 
being biased in his judgment of candidate performance in a debate. Despite this 
common-sense contention, the data suggest that party commitment might 
operate to bias ratings of candidates; whereas candidate commitment does not. 

This finding is all the more interesting when one realizes that despite 
some overlap between the two subsamples, the overlap is minimal; that is, 
some Ss who were selected on the basis of party commitment were also selected 
for their candidate-commitment scores. Moreover, not only are these sub- 
samples peopled to an extent by different individuals; but the variables that 
are the basis for forming the subsamples bear an almost negligible relationship 
to one another (r = .139). The fact remains, however, that these different 
subsamples did rate the debaters differently; yet the hypothesized effect for 
interaction is not significant. 

To get further insight into the problem, subgroups were formed on the 
basis of Ss’ responses to an item that had asked, “If you had to be affiliated 
with one of the following, which would you choose? Democratic Party, Re- 
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publican Party, Other.” Table 2 presents the relationship of major party 
affiliation and evaluation of debaters. Chi square, 16.45 (with two degrees 
of freedom), is significant at the .05 level. It is apparent that belonging to or 


TABLE 2 
RELATIONSHIP OF PARTY AFFILIATION TO EVALUATION OF QUALITY oF DEBATERS 


Rater’s Party Affiliation 


Relative rating of candidate Republican Democrat 
Nixon rated better than Kennedy 14 2 
Nixon and Kennedy rated as equal 8 5 
Kennedy rated better than Nixon 2 12 


being "affiliated with” a political party bears some relationship to the affiliated 
individual's perception of his own party's presidential nominee. It is inter- 
esting to note that this across-the-board relationship is much different from 
the common-sense relationship originally posited. Evaluative bias was not 
confined simply to highly committed people, but was distributed throughout 
the sample. This suggests that some other independent variable may be re- 
sponsible for the effect. However, equally as interesting is the finding in 
'Table 2 that neither party seems to hold a monopoly on biased perception. 
The number of biased Democrats was about equal to the number of biased 
Republicans. When we take a closer look at these 14 Republicans and 12 
Democrats, we find that they are about as heterogeneous en masse as was 
the overall sample from which they were drawn. "There are professional people, 
clerical and research assistants, some administrative people, police officers, and 
the ubiquitous undergraduate college student. Fifteen of the 26 are male, 
age ranges from 18 to 52, with a median age of 29. Despite the small size 
of the sample and the exploratory nature of the study, there remains the 
implication that the 28 persons may not be so unlike some that may be found 
in a national survey on the effects of some future presidential debate, 

To review, the study demonstrates a significant bias in the way politically 
affiliated people perceive opposing party presidential nominees. Despite having 
been instructed to the contrary, about an equal number of both Democrats 
and Republicans demonstrated this judgmental bias. Most таси 
the finding that the bias could not be related to a single common-sense es 
pendent variable—high commitment. Rather, the effect was noted in a hig! 
proportion of Ss almost across the board. . . 

A study that purports to infer the effects of the presidential debates from 
the “objective” reactions of observers must certainly take steps to ameliorate 
the judgmental bias that has been demonstrated. The point is that the effects 


bae 


RALPH L. ROSNOW 217 


produced by the debates will not be merely a function of the stimulus complex 
(the debates) but will in large measure be related to the characteristics 
of the judges. Consequently, in order to get any evaluative measure by judges 
about the debates, a great deal of descriptive information about the judges may 
be required. The fact that they were randomly selected and that their judg- 
ments were independent of one another may not be enough. Randomization, 
however, does give some assurance that the judgmental error is at least loaded 
comparably into both subsamples of Democrats and Republicans, assuming 
that the size of the subsamples is about equal. Perhaps a more satisfactory 
technique for making allowance for this uncontrolled variable is to determine 
the sampling-error adjustment that needs to be made to correct for judgmental 
bias. In the case where one may assume that ratings (or perhaps objective 
reports of the effects, or behavioral measures of the effects, etc.) of the debates 
are dispersed normally about a central measure, then the analysis of covariance 
should be considered, but what the particular covariance variables must be 
would require additional study. 

One other point needs to be made, and that is that though high commitment 
to candidate yielded no significant effect, the situation might have been very 
different were the debates of the preconvention intraparty type—that is, 
Democrat versus Democrat, or Republican versus Republican—where can- 
didate commitment might have been even more salient for the Ss. Moreover, 
in the present situation, Ss’ involvement was probably considerably more 
reduced than would it have been in 1960, and this suggests that raters might 
have been influenced in some measure by the prestige of Kennedy's office. Sup- 
Port for this interpretation is found in Table 3, which presents the relationship 


TABLE 3 
RELATIONSHIP OF RELATIVE CANDIDATE COMMITMENT TO PARTY AFFILIATION 


Relative Candidate Commitment 


Affiliation preference Nixon None Kennedy 
Republican Party 12 5 7 
Democratic Party 1 3 15 


between ordinally scored candidate commitment and party-affiliation pref- 
erence. With two degrees of freedom, x? = 12.22, and is significant at the 
:05 level. The datum suggests solid commitment from the Democrats, but a 
“splintering” of the Republicans approximately one-third of whom show a 
greater measure of commitment to Kennedy than to their party's nominee, 
This finding is reminiscent of the sleeper effect described by Hovland and 
Weiss (4) in their well-known study on communicator credibility. That is, 
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had the present study been carried out closer in time to the original debate 
the splintering might not have been as great. One might hypothesize that 
where an immediate posttest would have picked up a sharp division on can- 
didate commitment between the two major parties, but solid commitment 
within each party, a delayed posttest would show a tendency in the losing 
party toward a splintering effect. 

This “splinter” effect might be explained in the context of Festinger’s 
theory of cognitive dissonance (3). In the case of the individual initially com- 
mitted to a group or idea that has been granted irrevocable and public ac- 
ceptance by a majority of other Persons, such commitment would fail to 
subject the individual to any cognitive stress. Consequently, he would expe- 
rience no motivational force to alter his initial commitment. However, in the 
case of the individual who is initially committed to a group or idea that has 
been irrevocably and publicly rejected by other persons, the individual’s 
commitment is dissonant with the cognition of that rejection. Such an indi- 
vidual would experience strong cognitive pressures to reduce that dissonance, 
and the dissonance might be reduced by his repudiating his initial commitment 
and becoming more committed to the accepted group or idea. 


E. Summary 


An experiment was carried out to determine the role of “commitment” to 
a political party or to a particular party nominee and the subsequent biasing 
effect on evaluation of the presidential debates. Forty-nine persons were sub- 
jected to measurements of the intensity of such commitments, thereafter 
exposed to excerpts from the 1960 presidential debates while concurrently 
rating the debaters using a modified Program Analyzer, Despite their having 
been instructed to evaluate “dispassionately” the debaters, bias in judgment 
was obtained. The bias was not, however, confined to persons with high 
commitment to party or high commitment to candidate, but was noted in a 
high proportion of Ss almost across the board. Suggestions for ameliorating 
such judgmental error were discussed. 

In addition, a “splinter” effect was reported. That is, two and one-half 
years after election of the Democratic candidate, it was found that while the 
Democratic Ss remained fundamentally unchanged in their commitment to 
the Democratic nominee, approximately one-third of the Republican Ss mani- 
fested greater commitment to the President than to their own party’s nominee. 
This “splintering” of commitment was explained in the context of cognitive- 


dissonance theory. 
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SOCIAL-GROUP MEMBERSHIP, LEADERSHIP, 
AND BIRTH ORDER* 


Addiction Research Foundation, Toronto, Ontario, Canada 


REGINALD С. SMART 


A. INTRODUCTION 


Several attempts have been made to relate affiliative behavior to birth 
order: for example, Schachter (8) has shown that the need for affiliation 
during anxiety is greater among first-born children than among later-born 
Children. Becker and Carroll (1) have suggested that first-born children's 
Ereater need for affiliation leads them to join more social and recreational 
groups and that most groups contain a preponderance of first-born children in 
their membership. The possibility that first-born children have more social- 
group memberships than later-born children deserves study for several reasons. 
The first is that such study could extend our knowledge of the background 
correlates of affiliative behavior; the second derives from our knowledge 
that first-born children, compared to later-born children, demonstrate different 
Personality traits, such as greater dependency in school and peer relation- 
Ships (4); stronger responses to frustrating acts by others (3); and more 
tendencies to anger, intense emotionality, and violent reactions to defeat (5). 
"This raises the possibility that small-group studies or experiments employing 
members of social groups as subjects involve biased samples in terms of birth 
Order and many of the characteristics associated with birth order. 

Becker and Carroll (1) have described some informal data (from a study 
of delinquent gangs) that they think imply that first-born children are over- 
represented among gang members. Unfortunately, Becker and Carroll’s data, 
meant to be only suggestive, do not indicate the expected proportions of first- 
born, later-born and intermediate-born children under the null hypothesis 
of no birth-order difference. A complicating factor is suggested by Sampson's 
study (7), which showed that first-born females are more resistant to social 
influence than are later-born females; however, first-born males are signifi- 
cantly less resistant to social influence than are later-born males. A problem 
With Sampson's study is that the difference for females is significant only at 
the .10 level, and that level usually is то? taken as evidence for the support 
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of an hypothesis; so, among females, later-born children should have numeri- 
cally but not significantly more group memberships than first-born children. 
Among males, the opposite should be true, with first-born children having 
significantly more group memberships than later-born children. One purpose 
of this paper is to present data relevant to the foregoing hypotheses. 


The possibility that birth order and democratic leadership in groups are 
related has apparently never been examined, although several investigations 
suggest such a relationship. If, as some studies indicate (6), group leaders 
reflect the mean attitude of the group with regard to conformity,’ then first- 
born children should be more often elected as leaders than later-born children. 
It is easy to see how the first-born (hence, oldest) child would show leader- 
ship among his siblings, although it might be more autocratic than democratic. 
In addition, Dean (2) and Koch (5) have found that first-born children 
do show more leadership when young. This study predicts that first-born group 
members are more likely to hold elected offices in their groups as a reflection 
of their greater leadership characteristics. 


B. METHOD 


The data for this study were obtained from questionnaires distributed to 
370 students (197 males and 173 females) enrolled in four undergraduate 
courses at the University of Toronto. The students were asked to list the 
number of older and younger siblings: living as well as deceased. Siblings 
were taken to contribute to the student's ordinal position only if they had 
lived for the first five years of the respondent student's life. The students 
were also asked to list the social and recreational clubs to which they be- 
longed, together with any elected offices they held. Social or recreational ac- 
tivities per se were not counted unless they occurred in connection with some 
formal club membership. In addition, appointed positions were not counted 
as evidence of leadership. 


C. RESULTS 


All subjects were categorized as only children, first-born children, inter- 
mediate-born children, or last-born children. Further, each student was cate- 
gorized as having no club memberships or one or more memberships. The data 
from first-born children and only children were combined to form a first-born; 
those for intermediate-born children and last-born children were combined into 


a later-born group. 
The numbers of first-born males and later-born males who had one or more 
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TABLE 1 
THe NUMBER oF First-BorN MALES AND LATER-BoRN MALES 
AND GROUP MEMBERSHIP* 


Group memberships 


Birth order One or more None "Total 
First born 57 34 91 
Later born 51 55 106 

"Total 108 89 197 


* Chi square = 4.239, df = 1, р < .05. 


group memberships and those who had no memberships is shown in Table 1. 
The chi square, 4.239, is significant at the .05 level and shows that more first- 
born males are group members than are later-born males. Comparable data 
for females are shown in Table 2 and, in keeping with expectation, later-born 
females are more often members of social groups than are early-born females, 
but the difference is not significant (chi square = .205, p > .80). 

TABLE 2 


Tue Number oF First-BorN FEMALES AND LATER-BORN FEMALES 
AND Group MEMBERSHIP 


Group memberships 


Birth order One or more None Total 
First born 38 54 92 
Later born 37 44 81 

Total 75 98 173 


* Chi square = .205, df = 1, $ > .90. 

Each student who had one or more group memberships was characterized 
as holding or not holding an elected office. Since no differential prediction based 
On sex was made, the data for males and females were combined. The numbers 
of group members who held office and who did not hold office are shown in 
Table 3. Since the chi square is nonsignificant (chi square = .696, р > .80) 
there is no indication that leadership is associated with birth order. 

TABLE 3 


THe Numser оғ Group Members WHo Herp AND Wxo Dip Nor 
Hor» ELECTED OFFICE* 


Held Did not hold 
Birth order elected office elected office Total 
First born 29 66 95 
Later born 22 66 88 
Total 51 132 183 


* Chi square = .696, df = 1, $ > 80. 
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D. Discussion 


The results support both predictions for affiliative behavior. First-born 
males more often than later-born males are social-group members, but there 
is no birth-order difference for females. This finding adds further support to 
the proposition that first-born males have greater affiliative needs than do 
later-born males and supports Sampson’s findings (7) and Becker and Car- 
roll’s expectations (1). This result suggests that social groups composed 
wholely or mainly of males contain a preponderance of relatively dependent 
persons who are sensitive to social influence. This assertion, which of course 
bears further investigation, could seriously limit the generality of many socio- 
psychological investigations using members of social groups. 

The failure to obtain a significant birth-order difference for females is 
in line with the expectation based on Sampson's findings (7). Affiliative be- 
havior seems to be associated with birth order only in males; however, later- 
born females might demonstrate affiliative behavior in ways different from 
those of first-born males. Fewer females than males, regardless of birth order, 
were members of social groups; and it may be that female affiliative needs are 
more often met by informal social groups. Perhaps greater birth-order differ- 
ences would be found for females if questions were asked about the number of 
evenings spent alone, the number of times they go to classes alone, or about 
other less-formal social events. 

The independence of birth order and leadership in groups is contrary to 
the prediction based on studies of young children. Perhaps the first-born child's 
early leadership behavior is lost by the late teens or perhaps their leadership 
is more autocratic than democratic—as implied by holding elected offices. 
Further studies of the development of leadership tendencies and of different 
types of leadership are needed before these questions can be satisfactorily 
answered. 

E. SUMMARY 


‘The aim of this study was to relate social-group membership and leadership 
to birth order. Based on studies of conformity and affiliative behavior it was 
predicted that first-born males have more social-group memberships than do 
later-born males, but no significant birth-order difference was expected of 
females. First-born children were expected to be more often chosen for elected 
offices (indicative of democratic leadership) than are later-born children. 
'These hypotheses were examined by collecting relevant data from under- 
graduate students. It was found that first-born males do have more social- 
group memberships, whereas no difference obtains for females. The predicted 
dependence of leadership on birth order was not found. 
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EFFECT OF DISCREPANT COMMUNICATIONS ON 
SELF-PERCEPTION* 1 


Department of Psychology, University of Texas 


DEMETRIOS РАРАСЕОКСІЅ AND Brian M. McCann 


A. INTRODUCTION 


It is becoming increasingly apparent that the variables studied in opinion 
and attitude change may be especially relevant to the study of personal- 
influence processes that characterize in part counseling and psychotherapy. 
In the past, authors of various psychotherapeutic systems and procedures that 
range from advice to prolonged intensive psychotherapy have tended to rely 
on “clinical” intuition and observation in order to understand their successful 
and unsuccessful attempts at communication. Some unfortunate consequences of 
this, at best, partial approach have been discussed by Eysenck (5). More re- 
cently, conceptualizations of psychotherapy as a learning process (2) and direct 
attempts to relate the learning and acceptance of self-relevant information (3) 
have appeared in the literature. In addition, several practitioners [e.g., Ellis (4), 
Ruesch (12a), Wolpe (14)] are now relying in various degrees on methods 
of treatment directly related to experimentally studied variables of learning 
and of attitude change. Furthermore, Frank (8) has written a well-docu- 
mented account of the close connection between persuasion and personality 
change. . 

Of special relevance to the personality-change area is the effect of persuasive 
communications aimed at changing a person's existing perceptions of his 
Personality and self. Bergin (3) found an interaction between communicator 
credibility and discrepancy between communication position and the subject's 
prior self-appraisal on the masculinity-femininity dimension, In this study, an 
extremely negative message produced most change if the communicator was 
of high credibility and least change if his credibility was low. Although 
Bergin's results provide an excellent demonstration ofa prediction based on 
dissonance theory (6), it seems desirable that other personality traits and 
other communication variables should also be investigated. The experiment 
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to be reported introduces another personality dimension and attempts’ also 
to investigate the efficacy of positive communication as well as that of negative 
communication. 

Although the physical presence of a communicator is a standard procedure 
in counseling and psychotherapy, the experimental assessment of the precise 
communicator effect presents difficulties. The use of two different communi- 
cators [see Bergin (3)] appears to be an adequate solution, but it should be 
noted that a perceived difference in credibility is only one of a great number 
of ways that two people, physically present, may be perceived as different 
from each other. On the other hand, the presence of the same communicator 
acting different parts creates perhaps more serious difficulties in an experi- 
mental design. A possible step toward an unambiguous demonstration of com- 
municator effects is to remove the physical stimulus of the communicator and 
to insure that, at least as distal stimulus, the source of the communication re- 
mains fairly constant for all subjects. 

'The study reported here was designed to investigate the effects of a dis- 
crepant communication on a person's self-perception, holding the source of 
the message as constant as possible. A "personality" test, suitable for group 
administration and self-scoring, was so constructed that the vast majority of 
subjects would obtain a very "high" score on one scale. For some subjects, 
the high-scoring scale was labeled "hostility"; while for other subjects, the 
high-scoring scale was labeled “masculinity” (for male subjects) ог "femi- 
ninity" (for female subjects). Thus, by means of a test described as stan- 
dardized and valid, an attempt was made to hold the credibility of the source 
constant for all subjects; also to assess the effect of self-relevant com- 
munications of both negative and positive type. 

Specifically, the authors predicted (a) that a discrepant communication 
of the type just described results in self-evaluations in line with the direction 
of the communication, and (b) that a negative or less socially desirable com- 
munication finds greater acceptance than does a positive communication. Pre- 
diction (2) was derived from dissonance theory (6), since a negative com- 
munication is expected to create greater dissonance (hence, greater pressure 
to reduce the dissonance). Alternatively, it may be argued that self-evaluations 
of positive characteristics are more resistant to change than are similar eval- 
uations of one’s negative characteristics. In all likelihood, the latter are 
seldom included within the self-concept; consequently, if attitudes about 
negative characteristics are relatively hazy and uncrystallized, they may be 
more vulnerable to persuasive communication. 

It should be pointed out that even though the relevance of the present ex- 


x 


mA аллы: 


DEMETRIOS PAPAGEORGIS AND BRIAN М. MC CANN 229 


periment to some aspects of counseling and psychotherapy seems obvious, 
it is also true that by no means does it come close to an experimental analogue 
of either. Rather, the study reported represents a highly restricted investigation 
of a limited number of variables that may operate in communication situations 
when changes in self-perception are advocated. 


B. METHOD 


Ап attempt was made to change the subjects' self-perceptions of a positive 
personality characteristic (masculinity or femininity) or a negative personality 
Characteristic (hostility) by means of an "extreme" communication con- 
cerning one of them. The experimental groups were given a self-scoring 
group "personality" test so designed that each subject would score close to 
average on three test scales and extremely high on a fourth scale. This fourth 
scale, the Deviant Scale (D), was labeled “masculinity,” “femininity,” or 
“hostility” depending on the experimental condition. The post-test self-ratings 
of the experimental groups on the manipulated characteristic were compared 
to those of the control group who had also been given the test but who had 
not scored it; hence, received no communication. Thus, the design conformed 
to the so-called after-only design [Hovland, Lumsdaine, and Sheffield (10)] 
averting the danger of precommunication sensitization and enhancing the 
likelihood of acceptance of the stated purpose of the experiment at face value. 


l. Subjects 


Eighty-eight male and 78 female undergraduates enrolled in an introduc- 
tory-psychology course at The University of Texas served in the experiment. 
While all students in the course are required to serve as experimental subjects, 
Participation in any given experiment is voluntary. Groups of subjects were 
randomly assigned to one of the following experimental or control conditions: 
(a) masculinity deviant (N = 30) or femininity deviant (N = 22), (р) 
hostility deviant (30 males and 26 females), (c) control group (28 males 
and 29 females) ; i.e., subjects who took the test but did not score it. 

In addition, 17 male subjects and 1 female subjects drawn from the 
Same student population supplied semantic-differential ratings [see Osgood, 
Suci, and Tannenbaum (12) ] of the characteristics used in the communications 
in order to establish how these were understood and evaluated among the 
students, 

2. Materials 

A 107-item mimeographed “personality” test composed of items taken from 

the Minnesota Multiphasic Personality Inventory was given to the two experi- 
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mental groups and to the control subjects. Answers to the test were recorded 
by each subject on an answer sheet. Later, the experimental subjects received 
four specially constructed scoring stencils and a profile sheet on which they 
were to plot their scores from the four scales. The scales consisted of the Lie 
Scale (L), the Correction Scale (К), and the Psychopathic Deviate Scale 
(Pd) taken intact from the MMPI and scored in the usual way, and the 
contrived Deviant Scale (D). The last scale consisted of 25 MMPI items 
that had been answered in one direction only by more than 95 per cent of 
the males and females in the MMPI college-standardization sample. The 
scoring stencil for the contrived D Scale was keyed, so that on 21 of the 25 
items the most-common response contributed to the total raw score for the 
scale. The authors expected the vast majority of the subjects to score high 
on the D Scale and close to the mean on the remaining scales. 

The profile sheet was a simplified MMPI profile reproduced by mimeo- 
graph. At the bottom of the profile were the names of the four fictitious 
scales, appearing in a constant order for all experimental groups and called 
“Masculinity” or “Femininity,” "Creativity," “Social Adequacy,” and “Hos- 
tility.” All subjects were instructed to write the score obtained from the 
MMPI Lie Scale under “Creativity” and that obtained from the Correction 
Scale under "Social Adequacy.” For the masculinity-deviant and femininity- 
deviant groups, the D Scale scores were labeled “Masculinity” or “Еетіліп- 
ity"; the Psychopathic Deviate scale scores, “Hostility.” The reverse of this 
labeling was used with the hostility-deviant groups. As with the MMPI 
profile sheet, raw scores on each of the four scales were located on the center 
of the sheet; and the corresponding centile scores were determined from a 
conversion scale at each side. A solid dark line drawn at the 50th centile 
point indicated the mean score for the “test,” while dotted lines along the 
35th and 65th centiles were labeled as including the “average” range of 
scores for each personality characteristic. 

The experimental measure consisted of self-ratings of five personality 
characteristics described as self-satisfaction, masculinity or femininity, number 
of problems, hostility, and adjustment. Nine statements were presented for 
each characteristic, and the subject rated himself in comparison to the “average 
person” (“ауегаре male” and “average female” were substituted a the case 
of masculinity and femininity). The statements ranged пип I аш ex- 
tremely more than the average person” (Score 9) to I am extremely 
less than the average person” (Score 1). Each subject was asked to 
indicate what statement he considered most acceptable and most objectionable 
to him as well as his latitudes of acceptance and rejection [see Hoyland, 


Harvey, and Sherif (9)]. 
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3. Procedure 


Subjects, randomly assigned to hostility-deviant, masculinity-deviant or 
femininit y-deviant, and control-group conditions, were tested in mixed-sex 
groups of 10 to 20. The subjects were spread out in a large classroom in an 
attempt to preclude their seeing any test results but their own because it was 
anticipated that the test scores would be similar and surprising. 

Subjects were told that they were being tested as part of a nationwide 
Project designed to determine some personality characteristics of American 
college students. Both the contrived test, which was named the “California 
Polyphasic Personality Inventory” and the experimental measure that followed 
were presented as part of this testing. The former was described as a valid, 
standardized, and reliable instrument for which national norms were available, 
The students were also told that each test item had predictive validity, even 
though some items lacked face validity (a brief explanation of the technical 
terms was also given). When all the experimental subjects in a group had 
finished marking their answer sheets, the test booklets were collected; and 
the scoring stencils and profile sheets were distributed with instructions for 
scoring, plotting, and interpreting centile scores. This part of the procedure 
Was omitted in the control groups. А 

The second “test” (the experimental measure) was given after the subjects 
had been instructed to put aside the other materials. Detailed instructions 
were given. А 

Each group session was supervised by the same two experimenters (one, a 
male psychology faculty member; the other, a male graduate-student assistant), 

nstructions were always given in a standard fashion by the first-mentioned 
experimenter. The total time required for experimental groups averaged 
35 minutes, The subjects responded anonymously. 

At the end of the experiment, the subjects were informed of the true nature 
and purpose of the study, and were especially cautioned that the results 
Obtained on the contrived test were in no way related to their actual standing 


9n the four characteristics. 
C. RESULTS 
1. Semantic-Differential Ratings of the Characteristics 


It will be recalled that semantic-differential ratings of the characteristics 
Used in the experiment were obtained from 28 male and female subjects 
(drawn from the same student population), who did not participate ли the 
experiment proper. The main purpose of these ratings was to verify the 
assumption that hostility represented a negative trait and that masculinity 
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and femininity were viewed positively. The subjects were asked to rate 
several concepts including “hostility,” “а man who is very hostile,” “а girl 
who is very hostile," “masculinity,” “а man who is very masculine," "femi- 
ninity,” and “a girl who is very feminine.” The concepts were rated on 15 
bipolar scales, five of which represented the dimension of understandability 
[see Nunnally (11)], with another five based on the probabilistic or “belief” 
dimension, and the remaining five relevant to the evaluative or “attitude” 
dimension [see Fishbein and Raven (7) ]. 


» a 


э» 


The results of immediate interest were obtained оп the evaluative (e.g., 
good-bad) dimension of the concepts. Hostility and its related concepts re- 
ceived uniformly low evaluations from both male and female subjects, the 
mean scores ranging from 2.85 to 3.67 on the seven-point semantic-differential 
scale. The concepts related to masculinity and femininity uniformly received 
positive evaluations, the mean ratings ranging from 5.13 to 5.87. The mean 
differences between evaluations of hostility and masculinity or femininity 
concepts are significant at the .01 level; thus supporting the assumption that 
the experimental subjects did indeed receive positive communication (mascu- 
linity or femininity) or negative communication (hostility). 


Differences similar to those found with the evaluative ratings were also 
observed on the understandability dimension. Here, too, the hostility concepts 
were rated as significantly less understandable (5 < .01), less familiar, and 
more complex than either masculinity or femininity. Finally, the belief di- 
mension (e.g., possible-impossible) yielded uniformly high scores on all con- 
cepts, with a slight tendency for the positive concepts to receive higher ratings. 
On all three meaning dimensions sex differences were observed, but they were 
not consistent. If anything, female subjects tended to rate lower than males. 


2. Success of the Experimental Manipulation 


The experimental subjects plotted their scores from the four scales on 
the contrived self-scoring test on a profile sheet. It was intended that each 
subject would receive a high score on the D Scale and approximately average 
scores on the remaining three scales. Since the experimental manipulation was 
to some extent under the subjects’ control, it is necessary to show that the 
experimental subjects did, as was intended, receive discrepant high scores on 
the test profile. Only one female subject, who misused the scoring stencils, 
received a score below the 65th centile on the D Scale; so she was excluded 
from the results summarized below. The mean centile scores for the various 
experimental groups ranged from the 84th to the 86th centile. Jn contrast, 
the corresponding mean centile scores for the same characteristics in the 
nondeviant communication conditions (i.e, from the MMPI Psychopathic 
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Deviate Scale) ranged from the 35th to the 39th centiles, below the mean 
but still within the average range indicated on the profile. Average scores 
were also obtained on the remaining two noncritical scales. 


3. Effect of Communications on Self-Ratings 


Following the scoring of the test and the plotting of the profile, or taking 
the test only for the control subjects, the students rated themselves on per- 
sonality characteristics, including those experimentally manipulated. These 
ratings allowed comparisons of experimental and control subjects on what 
statement they chose as most acceptable and most objectionable, as well as 
on measures of their latitudes of acceptance and rejection. The latitude of 
acceptance was defined as the most-acceptable statement plus all other state- 
ments marked as acceptable; the latitude of rejection, as the most-objectionable 
statement and all other statements considered objectionable. Two measures 
were computed for each latitude: its size or the number of statements it 
included and its mean. 

As expected, the most-acceptable-statement rating proved to be the most 
useful of the foregoing indices [see Bergin (3)]. The results for this measure 
with experimental and control groups are shown in Table 1. None of the mean 

TABLE 1 


MosT-ACCEPTABLE SELF-RATINGS FOR THE EXPERIMENTAL AND CONTROL GROUPS 
ON THE TWO PERSONALITY CHARACTERISTICS 


Male Female xni 
inity Femininity hostility hostility ale Female 
770 deviant deviant deviant control control 
Parameter (N=30) (N=22) (N=30) (N=25) (N=28) (N=29) 


Masculinity or 
femininity 
Mean 6.47 5.32 5.43 5.04 5.75 5.34 
SD 141 1.52 1.07 1.06 1.17 0.85 
Moan 4.17 3.95 543 4.96 4.04 414 
SD 1.78 1.56 1.68 1.69 145 1.38 


ratings of the characteristics that received average profile scores differs sig- 
nificantly from the corresponding mean ratings obtained from the control 
subjects. According to Table 1, ratings of hostility by the masculinity-deviant 
group are not significantly different from those made by the masculinity- 
control group, the same being true of the femininity-deviant and femininity- 
control hostility ratings. Likewise, masculine hostility-deviant subjects do not 
differ from masculine-control subjects in masculinity ratings, nor do feminine 
hostility-deviant subjects differ from feminine-control subjects in femininity 
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ratings. Corresponding product-moment correlation coefficients between com- 
municated centile score and most-acceptable self-rating are as follows: 0.50 
(5 < .01) for males and 0.47 (p < -05) for females on hostility, —0.14 
(n.s.) for the masculine hostility-deviant group on masculinity, and +-0.20 
(n.s.) for the feminine hostility-deviant group on femininity. The discrepancy 
between the significant hostility correlations and the nonsignificant masculinity 
and femininity correlations is probably due to differences in the traits them- 
selves. It is also possible that the hostility communications were closer to the 
subjects’ initial position, while the masculinity and femininity communications 
advocated a more discrepant, or lower Position, as may be seen from the con- 
trol data. 

Table 1 shows that not all communications were successful in producing 
higher self-ratings. Hostility-deviant communications for male subjects re- 
sulted in higher mean most-acceptable ratings than those expressed by the 
controls (p < .01). The corresponding communications for female subjects 
also resulted in higher ratings (p < :06), but the magnitude of the difference 
was smaller. Masculinity-deviant communications resulted in ratings that 
were slightly higher than those of the control-male subjects (5 < .05), 
whereas femininity-deviant communications produced no significant changes 
in female subjects. The corresponding correlation coefficients between com- 
munication centile scores and most-acceptable ratings are as follows: male 
hostility-deviant subjects 0.12 (n.s.), female hostility-deviant —0.27 (p< 
:01), male masculinity-deviant subjects —0.11 (n.s.), and female femininity- 
deviant subjects, —0.02 (n.s.). 

The other measures obtained from the experimental questionnaire did not 
yield results that were as meaningful as those just reported. In terms of most- 
objectionable ratings, only the male hostility-deviant subjects showed a sig- 
nificant tendency (5 < .01, by Fisher’s exact probability test) to reject more 
often the extreme low rating on the trait than did the controls. No significant 
differences were obtained in the sizes of the latitudes of acceptance and rejec- 
tion, although the findings consistently suggested that the communications 
were effective. The mean latitude of acceptance and rejection scores contained 
several significant effects which followed the results obtained with the mean 
most-acceptable ratings. It should be noted that this last measure is not inde- 
pendent of the most-acceptable and most-objectionable ratings. 


D. DiscussioN 


The first hypothesis, whch predicted self-evaluations in line with the dis- 
crepant communication, receives unambiguous support only in the case of 


y -— 
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male subjects. Female students also rated themselves higher than the controls 
after learning their deviant scores on hostility, but the level of significance 
( < .06) is not high. A conservative conclusion is that the first hypothesis 
is supported for male subjects with both types of communication, while female 
subjects show a tendency to be affected only by the negative, hostility com- 
munication. 

The second hypothesis, which predicted a greater degree of acceptance of 
the negative communication is supported in the case of females. No clear 
support is available for male subjects, although inspection of the means possibly 
indicates greater acceptance of the negative communication. 

"The authors assumed that a communication that indicates a high standing 
on hostility is negatively evaluated by the subjects and that a communication 
that indicates a high standing on masculinity or femininity is positively eval- 
uated. This assumption received support from the evaluative ratings of the 
concepts obtained from subjects who did not serve in the main experiment; 
however, the positive-negative evaluative dimension was not the only dimension 
on which differentiation of the personality concepts was possible: a similar 
relationship was found on the understandability dimension. Thus, although 
the distinction between positively and negatively evaluated communications 
may account for the findings, it is also possible that the degree to which a 
concept is understandable to the subject or has specific and precise meaning 
for him determines his persuasibility. The less-understandable or less-specific 
Concept could be more susceptible to influence [Asch (1), Festinger (6), 
Sherif (13) ]. It remains for future research to elucidate the role of this com- 
Ponent of meaning by comparing, under the same conditions, explicitly defined 
Personality traits to trait names without explicit definition or supplied with 
only a vague definition. Attitude change may be more likely to occur when 
the object of the attitude is capable of plausible redefinition by the subject. 

The difference in susceptibility between males and females observed in this 
experiment remains a matter of speculation. The lesser susceptibility of females 
may be a cultural sex characteristic or may indicate a greater likelihood for 
female subjects to discount test information. The authors favor the hypothesis 
that femininity occupies a central position in the self-concept of females in 
Our culture and may represent an attitude that is not only a matter of con- 
cern to the individual but is also stable enough to resist change, In an early 
Part of this paper the authors advanced the notion that positive characteristics 
in general form a more basic part of the self-concept than do rue charac- 
teristics, The relatively small change observed in the masculinity dimension 
is also in line with this hypothesis; so it will be of some interest to see if the 
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significant masculinity effect obtained in this study holds in future replications. 

Another interesting question for future research concerns the effectiveness 
of the group-administration conditions used in the present study. It appears 
that group conditions, with the Possibility of overt subject reaction, makes 
the discounting of the validity of the communication easier for the subjects. 
At least this seems to be the case when highly negative messages are used. 
In a subsequent experiment, 30 male subjects were given very high scores on 
femininity and 23 female subjects obtained very high scores on masculinity, 
The overt reaction of these subjects to the communication indicated that group 
administration is not suited to highly discrepant negative communications of 
this type. On the other hand, with less-extreme messages, group conditions may 
facilitate a perception that the subject is undergoing a bona fide personality 
assessment. The self-scoring procedure may contribute further to this per- 
ception. 

E. SUMMARY 


Male and female college students took and scored a contrived “personality” 
test that resulted in extreme scores оп a positive personality characteristic 
(masculinity or femininity) or a negative personality characteristic (hostility). 
Subsequent self-ratings showed acceptance of both communications by the 
male students and a tendency by females to accept only the negative message, 
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DIAGNOSTIC USE OF THE INTERACTION PROFILE* 


Harvard University and Haverford College 


Ковевт F. Bates AND A. PAUL Hare 


A. INTRODUCTION 


Since the publication of Interaction Process Analysis [Bales (1)], the 
method described there for the observation of social behavior has been used 
by a number of different investigators. Experience has accumulated to the 
Point at which some useful hints may be given to the investigator who has 
taken observations of only one or a few groups and wishes to know what 
may be inferred about their peculiar characteristics. The present article sum- 
marizes the results obtained by different investigators in such a way as to 
Provide a frame of reference for this sort of diagnostic use. 


B. A REFERENCE PoPULATION OF PROFILES 


The population of profiles obtained by investigators who have used the 
method in a standard way for a given sort of group, task, or unusual con- 
dition provides a frame of reference within which any one profile gains added 
meaning. One may understand more about the particular situation from which 
his profile was obtained by discovering what other kinds of situations have 
given similar or different profiles. 

In its raw form, an interaction profile is a frequency count of the number 
of scores that a given observer has entered in each category of the total set used 
for observation. The present system uses 12 categories, as shown in Table 1. 
Because scoring proceeds continuously throughout the course of the interaction 
Observed, the raw number of scores obtained depends heavily upon the length 
of time. If chronological time has been measured, rates per unit of time for 
individual categories or for the total may be computed. Total interaction 
rate on a base of chronological time is a meaningful measure. Unfortunately, 
actual time rates are seldom reported; consequently, for comparisons across 
studies one must fall back on rates that are computed as percentages in each 
Category of the total number of scores for all categories. We have reduced all 


Profiles used in the present study to percentage rates. 
Similarities and differences in profiles are attributable in part to peculiarities 


ccc ANN 
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of observers. In deciding what should constitute a single profile in the refer- 
ence population it has seemed best to take only one summary profile from 
each study, so that the variance over profiles should most fairly represent the 
variance attributable to differences of investigators as well as to the situations 


they observed. 

An interaction profile may be tabulated for a given subject vis-à-vis one 
or more others or for any summation of subjects. According to diagnostic 
needs, one might want a reference population when the single profile is the 
profile of a single individual or he might prefer the summary profile of sets 
of individuals with similar characteristics, such as sex or age, or he might 
prefer the single profile to be the summary of all individuals in a given inter- 
acting group, regardless of differences in characteristics. An example would 
be a summary profile of therapist and patients together in a therapy group. 
Finally, a number of groups observed under similar conditions might be 
added together to produce the single profile. We have taken this final alter- 
native, 

A single profile in our reference population usually represents a whole study 
by a single investigator, but there is one exception. In the laboratory studies 
at Harvard, a standard diagnostic task has been used for a number of studies 
by different investigators. For present purposes, all of the Harvard studies 
using this task have been summarized in a single profile to avoid overrepresen- 
tation of this task in the reference population; thus the profile we have taken 
as the single case for the reference population is either a summary profile for 
à given study or a series of studies under the same conditions. This procedure 
averages out the effects of all sources of variance internal to a given study. 
The gain, however, is that usually we are able to relate differences in profiles 
to major features of the situation, task, type or condition of subjects that have 
been described by the investigator. : 

Table 1 presents 21 summary profiles available at the time of writing 
(1963) to make up a reference population. Each profile is shown as a 
column of percentage rates in each category on the base of the total number 
Of acts observed in the given study. А new profile may be placed in relation 
to the reference population by comparing it with each of the 21 until one is 
found which it most closely matches, but this is a laborious procedure. It 
would be easier to locate the peculiarities of a new profile if one had some 
idea of how far above or below the mean a given category generally has to 
be before we are able to connect it with some definite peculiarity of the situa- 


tion, task, or type or condition of subjects. 
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C. Means AND VARIANCES IN THE REFERENCE POPULATION 


The method for determining diagnostic cutting points usually involves 
computation of the mean and variance of each of the variables in the profile, 
but the present case presents special complications. The value of a given 
variable on a given profile in our population is a percentage rate. Unfortu- 
nately, percentage rates are not normally distributed within a sample drawn 
from a population of percentage rates. For rates of the magnitude obtained 
with the present method, the lower the mean rate the more skewed is the 
distribution because variation below the mean is restricted by the lower 
boundary of a zero rate; but variation is not restricted to the same degree 
above the mean. Fortunately, there exists a transformation that tends to 
normalize distributions of percentage rates: namely, the arc-sine transforma- 
tion [Snedecor (22) ]. For the computation of the variance in each category, 
we have applied this transformation to the percentage rates. 

Because the conversion of a profile of percentage rates into arc-sine equiva- 
lents involves appreciable additional labor, it is convenient for diagnostic use 
(once the variances have been computed and the points of practical interest 
above and below the mean have been located) to have a table that states 
the mean and cutting points in percentage form. Table 2 gives the mean for 
each category and shows the points of the distribution that fall one standard 
deviation and 1.96 standard deviations above and below the mean. 


TABLE 2 
MEAN PERCENTAGE RATE IN EACH CATEGORY OF THE INTERACTION PROFILE AND CUTTING 
Points ABOVE AND BELOW THE MEAN FoR THE DIAGNOSTIC REFERENCE POPULATION 


Percentage rates Mean per- Percentage rates 

Interaction-process below the mean centage above the mean 
analysis category 1.96 SD 1.00 SD rate* 1.00 SD 1.96 SD 

1. Shows solidarity 04 74 2.97 5.10 8.50 
2. Shows tension release 24 240 8.17 13.90 22.20 
3. Shows agreement 1.80 5.00 10.70 16.30 23.60 
. Gives suggestion 0 77 6.56 12.40 22.20 

Н Gives agian 7.40 13.50 22.24 30.60 40.20 
6. Gives information 9.90 17.90 28.72 39.60 51.10 
7. Asks for information 1.20 2.90 5.88 10 1 
$. Asks for opinion. 4:28 1.20 sar 0 530 
9. Asks for suggestion 0 03 . = i 
10. Shows disagreement r PU Ed 6.00 10.20 
11. Shows tension , 02 21 241 4.40 8.20 

12. Shows antagonism 0 = - 


* The means for each category are obtained directly by pic the original 
percentage rates, rather than by retransformation back from tl e are ne mona рш 
to the nature of the transformation there are small losses in transforming | ack Ra lee 
than leave the user with the uncomfortable feeling that there ie some mistake in 
table, we have avoided the transformation for the mean values shown. 
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What the user of Table 2 wants, we shall assume, is to be able to take a raw 
profile in which he has a special interest, to standardize it by computing 
percentage rates and (by comparing his profile with the mean rates and the 
cutting points on Table 2) to locate those particular rates on it that are un- 
usual enough to merit special explanation. Then he may want to refer to 
Table 1 to see in what kind of condition unusual rates of this kind have been 
observed before. The choice of cutting points above and below the mean for 
locating "unusual" rates in this sense is essentially a pragmatic matter. From 
our experience, we believe that points one standard deviation above and 
below the mean on each category may prove generally useful for diagnostic 
inspection and clinical interpretation of single profile. 


D. PROFILES FROM PARTICULAR STUDIES 


We shall now examine briefly, one by one, the profiles in the reference 
Population and the conditions under which they were obtained. This review 
will allow the reader to form an impression of the usefulness of the cutting 
Points suggested and will provide him with an annotated bibliography of the 
studies from which he can obtain a more complete description of the condi- 
tions, If most of the rates appearing outside the suggested cutting points prove 
to be plausibly related to major features of the situation, task, type or con- 
dition of subjects as described by the investigator, we shall believe that the 


Points have been well chosen. 
1. Bales’ Case-Discussion Problem 


A group of students, usually five in number, unknown to each other, ob- 
tained through the employment office or by writing to a random sample, 
meet in a laboratory room with microphones and one-way mirror which are 
explained to them as they enter (2, 3, 4, 19). They are not introduced to 
each other by name, and no leader is appointed. The task of each session is 
the discussion of a human-relations case, a five-page presentation of facts 
about a problem facing an administrator in his organization. Members are 
given copies of the case separately to read ahead of time and are told that 
although each is given accurate information the experimenter intends to leave 
them uncertain as to whether they each have exactly the same range of facts, 
After they have been read by the members individually, the cases are 
Collected to prevent direct comparison of typed copies; although members 
are allowed to take notes. The task defined for each session is to assemble the 
information, to discuss why the people involved in the REC ы behaving as 
they did, and to decide what should be recommended as action for the solution 
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to the problem presented. The group is asked to time themselves to 40 minutes 
and to dictate the group solution for the sound record in the final one or two 
minutes of the meeting. Then the experimenter leaves the room. The group 
is not arbitrarily stopped at the 40 minute limit, but is allowed to take a little 
longer if necessary. 

Inspection of the summary profile of many such groups [see Table 1] 
shows that in spite of the pains taken in the experimental situation to highlight 
the desirability of communicating factual information, the rates of asking for 
information and giving it are both below the cutting point; however, the 
rate of giving opinion is above the suggested limit, identifying this problem 
situation as one with a notable requirement for the communication of infer- 
ences and value judgments. Rates of showing both agreement and disagree- 
ment are also high. (By “high” or “low” the authors mean above or below the 
suggested arbitrary cutting point one standard deviation from the mean.) 
The high rates of showing agreement and disagreement are plausibly related 
to the presumed status equality of members (no role differences established 
and symmetrical distribution of information) and to the requirement for a 
concrete and specific group decision. Under these conditions, most problem- 
solving attempts by a given member tend to obtain a reaction from some other, 
whether positive or negative. 


2. Slater’s Post-Task Social Session 


The situation is that described earlier, just after the group members have 
individually completed a short questionnaire asking them to evaluate and 
describe each other on several different dimensions and to indicate feelings of 
liking and disliking for each other and the task situation (20). The ex- 
perimenter says that he has one additional short questionnaire, but that he 
would like the members to spend a few minutes talking informally before 
he gives it. He says, “just talk about anything you like, I'll be back in about 
10 minutes.” He then leaves, and stays away from 13 to 15 minutes. 

The summary profile shows that under these conditions the rates of asking 
for suggestions and giving suggestions drop to zero. The showing of antago- 
nism also disappears. The rate of asking for information rises above the cutting 
point as the members “make social conversation” by asking about names, 
home towns, mutual acquaintances, courses, places of residence, and other 
mutual topics of reference. The rates of showing solidarity and tension release 
are also high, as cigarettes are passed around, and smiles, jokes, and laughs 


are exchanged. 
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3. Couch's Prediction and Self-Revelation’s Task 

The groups are essentially like those described earlier, but each group is 
now in a third session of the same set of members, with an expectation of 
three more sessions to follow (7). Each member answers privately a single 
item from a personality test or a values questionnaire. He is then asked to 
predict or guess how each of the other members in the group will answer the 
Same item and to record privately his prediction. Members are then asked 
to reveal their answers to each other, note their errors, and discuss with 
each other in any way they like. Their aim is to improve their predictions on 
the next item. They go through a number of such cycles. 

The rate of showing tension is high under this set of conditions; also, the 
rate of showing solidarity. This is presumably a stressful situation for several 
reasons. Not only is there a requirement to reveal to other group members 
feelings and characteristics that might otherwise be kept private; but success 
in prediction is difficult, failure is immediately revealed, and in the process of 
failing one may inadvertently reveal misconceptions he has of the other that 
may not be complimentary. Presumably the high rate of showing solidarity 
reflects an attempt to keep relationships from deteriorating in the face of these 


threats. 
4. Borgatta’s Three-Man Groups Planning Role Playing 


One hundred twenty-six enlisted men at an Air Force Base are assigned 
to temporary duty to participate in an experiment (5), Tay Are твоей 
from different organizations on the base, to minimize acquaintance. They meet 
in a new observation room with a one-way mirror. The experimenter explains 
that the purpose of the experiment is to observe how small groups work 
together, He says observations will be made of role-playing sessions which 
they themselves plan. Each man participates in four sessions of 48 minutes 
each with two new participants each time. About half of each session is given 
to getting acquainted and planning the role-playing situation the other half 
Consists of the role playing. The planning part of the session is observed, as 
well as the role playing proper. The profile we show is that for the planning 
half of the session. . 

Under these conditions only two rates are different enough to focus on as 
à diagnostic problem. Showing disagreement and showing antagonism are 
below the cutting point. The situation was not designed as it was in order 
to depress these types of interaction; however, it is not unreasonable to sup- 
Pose that enlisted men ordered on temporary duty and placed with others 
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they do not know under observation in a room with a one-way mirror might 
imagine their superior officers to be behind the mirror evaluating their per- 
formance, and might tend to be “оп their best behavior.” It may be noted that 
while the rate of showing tension is not above the cutting point, it is markedly 
higher than the rate of showing disagreement. Our experience has led us to 
believe that this inversion is a presumptive indicator of stress due to inhibition 
of negative reactions. The two preceding profiles both show this pattern also. 


5. Mills’ Three-Man Groups Constructing Projective Stories 


Three students, obtained through the employment office, gather in the 
observation room and are briefly told about the microphones and the mirror 
(14). The experimenter gives each of them a picture, one from the standard 
set originated by Murray for the Thematic Apperception Test, and allows 
a short time for each to look at his own picture separately. The pictures are 
then collected, and the experimenter asks the group to construct a dramatic 
story that in some way weaves the three pictures together in an account of 
what has gone on before, what is going on now, and what the outcome will be. 
He then leaves the group. Other groups of three see all three pictures jointly 
instead of separately, thus changing to some extent the problem of assembling 
information. The profile we show sums scores from both of these two sets of 
conditions. 

The profile under these conditions shows a low rate of giving information. 
In spite of the fact that under one variation each subject must describe his 
picture to the others, the rate is low. One of the intended characteristics of 
the pictures, of course, is that they shall contain a minimum of clear-cut 
factual information. They are vague and ambiguous. It is difficult to make 
statements about them without inference, evaluation, and other “projected” 
elements from the subject’s own experience. On the other hand, the “opinions” 
produced are not clearly differentiated from suggestions because the task 
involves an accretion of such elements only loosely strung together into a 
dramatic form rather than a tightly organized structure of carefully selected 
elements in a highly reasonable arrangement. Almost anything said may be 
taken as a suggestion by the other. Under these conditions the rates of asking 
for suggestion and giving suggestion are high and so is the rate of showing 
agreement. The mood, presumably, is one of phantasy comparatively un- 
checked by fears of offending the other members or violating logical demands. 
There is no definite feedback from the environment that would indicate one 


suggestion is much better than another. 
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6. Olmsted’s Four-Man Groups Playing a Board Game and Making 
up a Story 


Twenty-four groups of four men each are composed of volunteers from 
a university population (15). Each group meets for two 20-minute sessions. 
During the first session the group members play a board game with the 
experimenter as an opponent. During the second session the members rank a 
series of paintings in order of group preference and then make up a story 
based on the paintings. Before the sessions begin the group members are 
given special instructions concerning the best way to go about solving the 
problems, Half of the groups are told that it is best to pay particular attention 
to members’ feelings, while the other half are told it is best to be business-like 
and not to get involved with personalities. 

Since one of the tasks (making up a story about a set of pictures) is similar 
to that used by Mills it is interesting to note that rates of giving and asking 
for suggestion are high and giving information is low. However, Olmsted’s 
groups are high on disagreement (although low on antagonism); whereas 
Mills’ groups were high on agreement. This difference may be connected 
with the need for greater logical rigor in the board game as compared to that 
in the more projective task. The board game (like the projective task), how- 
ever, provides visually complete information to all members; thus reducing the 
need for verbal communication of information. Also, the two tasks are similar 
in requiring a large series or collection of specific suggestions for communica- 


tion to the experimenter. 
7. Slater’s et al.’s Four-Man Groups Under the Influence of Lysergic Acid 


Twenty-four male subjects, all college students, are formed into six four- 
man groups (21). Each group meets twice, once in a control meeting under 
normal conditions and once for a period of one and one-half hours following 
the administration of lysergic acid diethylamide (LSD). Three of the groups 
receive LSD prior to their first meeting ; three, prior to their second meeting. 
For each meeting the subjects are brought into a small room and told to sit 
around a rectangular table. An observer with an interaction recorder is 
seated at another table opposite the subjects. The subjects are then instructed 
to read a case study of a human-relations problem in an administrative situa- 
tion, discuss it, and within 40 minutes reach a group decision as to how the 
problem should be handled (Bales’ standard task). The profile given in Table 
1 represents the average interaction pattern for the six groups under the 


influence of LSD. 
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In these groups the rates of showing of solidarity and tension release are 
extremely high. Both percentage values are more than 1.96 standard deviations 
above the mean. Slater reports that even subjects with depressed or withdrawn 
reactions participate in this apparent hilarity, however involuntary their 
participation may seem. The task-oriented parts of the profile are similar to 
those of other groups on the Standard Case Discussion, except for the rate of 
giving opinion, which is less than half as high as for normal groups on this 
task. The additional fact that asking for opinion is below the cutting point 
seems to indicate a marked drop in concern for logical analysis or consensual 
evaluation in these groups. In general the subjects seem to be using the group 
situation to relieve individual tensions rather than to meet the demands of 
the task. 


8. Lennard’s et al.’s Subjects Under the Influence of Lysergic Acid 


The subjects in this experiment are four college graduates, three female 
and one male, who were considered nonpsychotic on the basis of psychiatric 
interviews and a battery of clinical psychological tests (13). The subjects 
are paid to participate in a discussion of “The place of women in society." 
The discussion is recorded, transcribed, and scored from the typescript. All 
of the subjects participated under the influence of lysergic acid. Two of the 
subjects received 50 micrograms of LSD 25 before the session; the other 
two, 100 micrograms of LSD 25. 

In comparing the interaction profile of this group under the influence of 
LSD with the same group in normal conditions, Lennard ef al. note that 
verbal output is restricted and that there is a marked reduction in negative 
interpersonal responses. Compared with other groups in our reference popula- 
tion this group is not markedly low on negative responses, though it is below 
the mean; however, the rates of giving and asking for opinion are high. The 
authors attribute this rise to the attempts of the members to restore cognitive 
clarity despite their felt impairment due to the LSD. These rates are in 
marked contrast to the rates obtained by Slater in the study just described. 
Lennard’s groups do not have high rates of showing solidarity and tension re- 
lease. It appears that the reaction in Lennard's groups is quite different from 
that in Slater's. It is possible that the topic and the sex pep ce of the 
group constituted a basically different set of conditions с t is que 
topic that could not be ignored. It is also possible that acts of tension release 
were not as evident in the typescript as they might have been in live recording. 


9. Takala's et al.’s Groups Drinking Brandy and Beer 


The subjects are employees of the Finnish government chosen from volun- 
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teers in consultation with the company’s doctor and the supervisor concerned 
(25). They meet in eight groups of four to six members immediately after 
work in a room suitably prepared with tables and tablecloths. Each group 
discusses a series of three topics during a five-hour testing period. A 15-to-20- 
minute sample of interaction is observed for each topic. During one session 
brandy is served; during another, beer. The profile in Table 1 is a com- 
bination of the observations under both the brandy and beer conditions. 


The authors note that their Finnish groups not under the influence of 
alcohol are different from ordinary American laboratory groups. The low 
frequencies of showing solidarity and tension and asking for information and 
opinion may be attributable to cultural differences rather than intoxication. 
The rate of giving opinion, which is high according to our standards, is never- 
theless lower than in the Finnish normal groups; thus the effect of alcohol in 
these groups is to depress the rates of activity in the task categories and 
increase the amount of disagreement and antagonism. 

Perhaps the most salient aspects of this profile are the high rates of showing 
disagreement and antagonism and the low rate of showing tension. These are 

expect from the action of alcohol. The low rate 


effects one would plausibly c 
of showing solidarity may not be characteristic of the action of alcohol under 
h testing whether or not a more usual 


other conditions. It would be wort ; 
later followed by an increase in 


pattern is an early increase in solidarity llow 
antagonism. The study showed that the groups drinking beer tended to be 


more positive than those drinking brandy. 
10. Psathas’ Therapy Groups 


Two therapy groups, each with four patients, were seen by the same 
therapist for one year in psychoanalytically oriented psychotherapy (16). 
Nine meetings of the two groups are selected for analysis: three at the begin- 
ning of therapy, three at the middle, and three at the end. All the patients 
have psychosomatic problems, predominantly asthma. The groups meet in a 
room wired for sound recording. An observer behind a one-way mirror 
records who is speaking to whom. No detailed notes are made of the content 
of what is being said except to note the first and the last words of each per- 
son’s burst of speech. However, nonverbal acts (such as nods, gestures, biting 
fingernails, and looking away) are scored according to the Bales category 
system. The verbal interaction is scored by playing back a tape recording of 
the meeting. To this record is added in the proper places the nonverbal acts 


that have been directly observed. 


In the profiles for these therapy groups both giving suggestions and asking 
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for suggestions are low, an indication that the members are not concerned 
with immediate and specific solutions. In the permissive atmosphere of these 
therapy groups it is not necessary for the group members to reach consensus 
on the solution to common problems. 
As an explanation of the high rate of showing agreement, Psathas (16) sug- 
gests that an 
... important reason for the maintenance of a high level of positive acts 
throughout all phases for therapy groups is probably the continued need 
to encourage patients to present their problems. The therapist is often 
high in simple acts of agreement and probably many of these acts are 
nods of the head or simple phrases indicating that he is listening and is 
interested in having the patient go on. Such acts, which are also initiated 
by patients, function to facilitate the presentation of material for dis- 
cussion. 


11. Winter’s Prison Therapy Groups 


Four therapy groups of 10 to 12 men each are observed during their 
usual one-hour therapy sessions in a state prison (26). The observer, who is 
seated in the back of the room, later checks his scores by listening to a tape 
recording of the session. 

The therapist follows a method of guided group interaction which empha- 
sizes the role of the leader in the therapy process. The leader’s initiative is 
evident in the high rate of asking for opinion since the leader’s interaction 
accounted for about half of the activity in this category for each group. 


12. Roberts’ and Strodtbeck's Groups of Paranoid-Schizophrenic 
and Depressed Patients 


Two groups of five male patients, one a group of paranoid schizophrenics 
and the other a group of depressed patients, are observed as they reach a 
group decision under the leadership of a psychiatrist (18). During the discus- 
sion each of the observers notes the target of one subject’s action and scores all 
the nonverbal acts that could not be recovered later from a tape recording. 
Later, when the tape recording of each session is transcribed, the observers’ 
notes are added to the final protocol. The interaction-process scoring 15 then 


done from the transcribed protocols by persons who observed the sessions and 


d in the Bales’ technique. | | 

In the combined group profile shown, two categories of acts differ from the 
average. Showing antagonism is high, primarily a reflection of the generally 
“hostile” characteristics of the paranoid-schizophrenic patients. The high 
rate of showing tension reflects the high-tension level of both types of patients. 


were traine 


4 
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13. Evans’ Therapy Groups 


Two therapy groups of 10 persons each are composed of male, adult patients 
at a psychiatric clinic for veterans (8). Each group meets for 16 sessions. Two 
observers score each session through a one-way mirror using an earlier version 
of the Bales’ category system which contained 49 categories. Acts on which 
the observers show marked differences in scoring are omitted, and the re- 
maining categories are combined into 12. 

Both patients and therapists are high on giving information and low on 
Most of the interaction of the patients in this type 
ving information about the self. The high rates 
ly the result of the activity of 


asking for information. 
of therapy appeared to be gi 
of asking for opinion and suggestion are primari 
the therapists as they sought to draw out the patients. 


14. Hare’s et al.’s Psychiatric-Stress Interviews 


A series of 24 interviews with adults are observed through a one-way mirror 
in an initial psychiatric interview (11). The interviewer follows a partially 
standardized pattern, based on Chapple’s methods. The same experienced 
psychiatrist interviews all 24 patients. The majority of the subjects are out- 
patients in a large New England hospital. They are predominantly lower 
class with an average age of 36. Seventeen are female; seven, male. 

The interviewer has certain rules to follow during each of the five sub- 
periods of the interview. The rules govern the timing of his responses, but 
not the content of what he says. Periods 1, 3, and 5 are “free” in that no 
behavior stress is applied. For each patient utterance the doctor must respond 
within one second after the patient has finished speaking. In addition, the 
interviewer must speak for approximately five seconds during each of these 
utterances, In Period 2, a silence stress is applied to the subject as the inter- 
viewer now “fails to respond” by waiting up to 15 seconds after the subject 
has finished speaking. If the subject still has not spoken after the 15 seconds, 
the interviewer speaks again for five seconds. This period ends after 12 
“failures to respond” or 15 minutes, whichever is shorter. Period 4 is one of an 
interrupting stress. Each time the subject speaks the interviewer waits for 
about three seconds and then interrupts with a five-second utterance for a 
total of 12 times. If the subject continues to speak through this interruption, 

til the subject has finished his 


the interviewer does not interrupt again Un", s 
utterance, The time limit for Period 1 is 10 minutes; for Periods 3 and 5, 
к Period 2 is eight minutes long and 


five minutes. In the average interview, t 1 
Period 4 is four minutes, bringing the average total interview to about 


32 minutes. 
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The interaction pattern that results from this combination of predetermined 
formal interaction pattern and a process devoted to fact gathering rather than 
group problem solving differs markedly from the average pattern of the 21 
types of groups. The doctor plays the major part in asking for information to 
which the patient responds by giving information. In addition to the gener- 
ally high level of tension shown by the patient throughout the interview, the 
doctor also contributes to the tension score as he fails to respond during the 
silence stress. No suggestions are asked for by either party, and only a few 
direct suggestions are made by the doctor. Low rates of joking and laughing 
and showing solidarity indicate that the tension throughout the interview 
is unrelieved. Low rates of agreement and disagreement reflect the fact that 
group consensus is not required by the task. The patient’s acts contribute most 
heavily to the total interaction profile since he contributes approximately four 
acts for every act of the doctor. 


15. Strodtheck’s and Mann’s Jury Deliberations 


The participants in these mock-jury deliberations are drawn from the 
regular jury pools of the Chicago and St. Louis courts (24). The jurors listen 
to a recorded trial. They then deliberate and return their verdict—under the 
customary discipline of bailiffs of the court. The deliberations are recorded 
with two microphones to facilitate binaural identification of individual par- 
ticipants. The recordings are fully transcribed, and these protocols are in 
turn scored in terms of interaction-process categories. The scoring is done by 
an assistant who listens again to the recording and has available the indi- 
cations of nonverbal gestures made by the original observer. "The level of 
interscorer reliability is checked before the Scoring begins and rechecked 
periodically while scoring is in process. 

The 12 protocols utilized in computing the summary profile shown are the 
final 12 of a set of 30 in which jurors considered an auto negligence case. 
Seventeen of 144 jurors originated less than five acts each and have been 
dropped from the tabulations. 

In the task area this summary profile does not differ from the average of 
the 21 profiles. However, the jurors show more agreement and less tension 
and antagonism than the average group. A higher-than-average amount of 
agreement is also found in other types of discussion groups in which the 
group members are required to reach consensus on an issue [see Table 1]. 


16. Strodtbeck's Husbands and Wives 


Each of 10 couples from three cultures, Navaho, Texan, and Mormon, are 
asked to discuss the differences in their ratings of some mutual friends (23). 
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Each couple is asked to pick three reference families with whom they are well 
acquainted. The husband and wife are then separated and requested to desig- 
nate which of the three families appears most satisfactory with respect to 
each of a series of 26 conditions, such as “Which family has the happiest 
children?" “Which family is the most religious?" “Which family is the 
most ambitious?” After both husband and wife have individually marked 
their choices, they are requested to reconcile their differences and indicate 
a final “best” choice from the standpoint of their family. The discussions are 
recorded in the field by portable sound equipment powered from a truck. The 
recordings are then transcribed and, in the case of the Navaho, translated into 


English before being scored. 

Although Strodtbeck found differences in interaction profiles between cul- 
tures and between the most-talking and least-talking spouse in each culture, 
the only category in which the summary profile for all 30 couples is dis- 
tinguished from the others in Table 1 is a high rate of agreement. This 
seems to reflect the fact that the task called for reaching agreement on a 


number of judgments. 
17. Quade’s College Couples 


Sixty college couples are observed from behind à one-way mirror, while 
discussing two problems for an average time of 31 minutes per problem (17). 
Since couples vary considerably in the time it takes to solve the problem 
(range: 10 to 65 minutes) the scores for the first 15 minutes of each session 
are used. In those cases in which the couple talks for less than 15 minutes 
an adjustment is made to yield comparable data. р 

For the discussion of the recreation problem, given first, a large calendar 
is placed on a work table. The couple is instructed to go through the calendar 
day by day and construct a schedule of all their leisure and recreational 
activities. Each makes a separate schedule. One spouse is not allowed to 
enter an activity on his schedule without the consent or agreement of the 


other. 


For the budget problem, the couple is told : 
ditures for a typical month. 
truct a budget of all your expen f 
ме mi i are to use in building your budget is your own present 
hs 1 tebe haa pay plus 25 per cent. You cannot spend more than one 
оша » the 25 per cent on any one thing. Keep separate records of your 
e ©, m enter anything without the consent or agreement of your 
хо Wien, you have finished the budgets will be identical. 
After presenting each problem the observer leaves the room where the 
couple is working and records their interaction through a one-way mirror. 
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Operating under these task instructions, which require immediate and 
specific consensus at many points throughout the total problem, the rates of 
giving and asking for suggestions are higher than average. The low rate of 
showing solidarity probably reflects several factors. First, as Quade notes 
in his analysis of the data, married couples devote a very small percentage 
of all their actions to building solidarity, presumably because they are more 
secure about the level of solidarity they have obtained. Second, half of the 
couples in Quade’s sample were classified as maladjusted; and these couples 
tended to have even lower rates of showing solidarity. Finally, according to 
Quade, the recreational problem involved less potential conflict than the budget 
problem and therefore required fewer attempts to restore equilibrium by 
showing solidarity. 


18. Landsberger’s Labor-Management Mediation Sessions 


Twelve mediation cases are analyzed from transcribed recordings (12). 
Each of the groups consists of employees and officials of labor unions; of 
employers and their lawyers or other adjunct personnel; and of a labor medi- 
ator. It is the mediator’s function to assist the two parties in reaching an 
agreement after bipartite negotiations between them have previously broken 
down. The cases are typical of those involving enterprises with less than 25 
employees. Each case was settled or adjourned sine die at the end of one meet- 
ing—the one recorded. Each mediator has an office in which the mediation takes 
place. He differs from an arbitrator who typically hears a case, much as does a 
judge, and thereafter makes a formal award. The mediator does not make 
an award; he clarifies, persuades, and suggests. 

The emotional involvement of the parties in the discussion is reflected in 
the higher-than-average amount of showing disagreement and antagonism. 
The amount of hostility within a session is negatively correlated with the 
success of the outcome of the session. The serious intent of the sessions is 
suggested by the low rate of showing tension release. 


19. Hare's Boy Leaders 


Twelve third-grade boys who have been identified by their teachers as 
leaders on the school playground are observed at play with their friends in 
their home neighborhoods after school (10). Six of the boys have been selected 
‘because they are group oriented; the other six, because they are self oriented. 
‘Two observers score interaction in the field. Each observer scores only one 
boy at a time, recording all of his acts directed to others and all acts directed 
to him by other group members. When possible, each observer makes half of 


the observations on each subject. 
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The average observation period in the neighborhood is four sessions for 
a total of 75 minutes (218 acts). Interaction with parents or observers is 
not recorded; only the interaction with other children is included in the 
Profiles. Nonverbal interaction called for by the rules of the game is also 
omitted. 

This average profile for boys’ leaders and their peers is similar in several 
respects to that of Gruber’s children playing with dolls. Both show a marked 
contrast with the Harvard-laboratory discussion groups. For the children, 
agreement and giving opinion are low; asking for information is high. The 
tendencies are just the opposite of those in the problem-solving groups. 
Rather than to deliberate and attempt to reach consensus on issues, these boy 
leaders and their followers interact very directively, with high rates of 


giving suggestions and showing antagonism. 


20. Gruber's Children Playing with Dolls 


The behavior of 13 preschool children judged to be high on Task orienta- 
tion and 13 judged to be low on Task orientation is recorded as they play 
with a doll house and dolls in the presence of a psychologist (9). When each 
child is taken into the.playroom he sees a doll house without dolls. He is 
then given dolls as he asks for them. As the child plays with the dolls for 
40 minutes the psychologist takes longhand notes of the child's behavior and 
interaction. 'The protocols are scored later, using the Bales' categories. 

Since the psychologist is primarily interested in how much task behavior 
the child will initiate if left to his own devices there is relatively little inter- 
action with the experimenter. The intention of the experimenter is reflected 
in his low frequencies of agreement, giving opinion, and asking for opinion 
and suggestion. The high rate of tension represents the child’s attempts to 
withdraw from play with the doll house. The high rate of showing antago- 
nism, which was also evident for the boy leaders, would seem to indicate that 
children are freer in expressing hostility than adults except when the adults 


are intoxicated or under some emotional stress. 


21. Borgatta’s et als Students of Interpersonal Relations 


A class of 47 graduate students in a course on interpersonal relations 
is divided into five subgroups that meet for a two-hour discussion period 
each week (6). The class is made up of relatively mature Persons (average 
age 34 years) representing a number of socially oriented professions. There 
is a slightly larger number of females than males. At the time the study takes 


Place the groups have met for at least nine weeks. 
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The groups have little or no external supervision. For each session, per- 
sons are selected on an informal rotational basis to serve in the positions of 
leader, observer, and recorder. The latter two seem to be similar in form 
because they both require the incumbent to record what is going on, although 
from different points of view. There is an emphasis throughout the dis- 
cussions on the analysis of the group’s own process and on aspects of behavior 
that tend to reflect "democratic leadership.” 

Each of the five groups is observed twice, for approximately two hours each 
session. Although the investigator is in the room, he is excluded by agreement 
from the attention of the group. Remarks made during the sessions and in 
discussion after the second session indicate that individual members occa- 
sionally were conscious of the observer's presence; however, in most cases 
the groups operated in what they described as their normal manner. 

These discussion groups and the husbands and wives observed by Strodt- 
beck are closer to the mean profile for all groups than any other groups in 
the sample. Both differ from the average by showing a rate of agreement 
that is one standard deviation above the mean. In this case the high amount 
of agreement seems to be a reflection of the emphasis on “democratic process.” 


E. SUMMARY . 


A number of investigators have used the same set of categories for inter- 
action-process analysis. Twenty-one profiles from various studies are presented 
as a reference population. Each profile is either a summary for a given study 
or for a series of studies under the same conditions. An interaction profile for 
a group or some set of individuals can now be compared for diagnostic purposes 
with this reference population. 

A. profile from a given study is made comparable with others by trans- 
forming the number of scores in each category into a rate that is computed as 
the percentage of the total number of acts in all 12 categories. Means and 
standard deviations for the rates in each of the 12 categories were computed 
for the 21 reference profiles. Cutting points of one standard deviation above 
and below the mean appear to be useful for diagnostic inspection and clinical 


evaluation of single profiles. 
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CORRELATES OF CIVIL-RIGHTS PARTICIPATION®*?-2 


University of Rhode Island and Fisk University 


LESTER Carr AND S. OLIVER ROBERTS 


A. INTRODUCTION 


Examination of the literature on social attitudes raises questions as to the 
degree of consistency between verbal attitude expression and overt action 
(4, 8, 14). In a comprehensive study, Pace (11) analyzed the replies of some 
2500 university alumni to a 16-page questionnaire that included seven “Ac- 
tivity” scales and nine Likert-type “Opinion” scales. On the basis of the small 
correlations found between the attitude and activity measures, Pace declared, 
“Opinions are the verbalized expression of attitudes, opinions are not action.” 

Campbell (3) takes the position that much of the so-called inconsistency 
between attitudes and action is “pseudoinconsistency” and would disappear if 
more reliable and valid overt-behavior measures were used and a wider variety 
of behaviors was sampled. In line with this, the Rosenzweig Picture-Frustra- 
tion test, a Likert-type scale, a Guttman scale, and the H Technique were 
used to predict the willingness of college students to sign a petition in favor of 
Fair Employment Practices Legislation (6). These four attitude-measuring 


devices were able to predict the criterion almost equally well, with biserial 


correlations of .54, .61, .58, and .59 respectively. However, the criterion 
measure employed is questionable because the signing of a petition may be 


more representative of verbal behavior rather than of complex social-action 


involvement. 
Searles and Williams (13) administered questionnaires to 827 students at 


three Southern Negro colleges and investigated the relationship of demogra- 
phic and attitudinal variables to civil-rights activities. When their data were 
divided into three groups according to the “degree” of respondent's partici- 
Pation, they found “fewer than [a] chance number of statistically significant 


differences among the three.” 
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Recent activities of Negro American college students in “sit ins" and 
similar social-action participation provide unique opportunities to check 
expressed attitudes against overt behavior: i.e., to obtain measurable behavioral 
correlates of verbalized attitudes. In this study, three conventional attitude 
scales designed to measure attitudes toward social action were related to the 
level of involvement in and frequency of social-action activities; also, cor- 
relations among the various attitude and behavioral measures were compared. 
In addition to the primary methodological purpose, another aim was to explore 
some self-concept and background variables and their relationship to social- 
action behavior. 


B. METHOD 
1. Subjects 


The total pool for the study consisted of 501 Negro American college 
students, freshmen and sophomores, in a predominantly Negro, private liberal- 
arts university located in the South. A group of 332 Ss (104 males and 224 
females) was randomly drawn from the total pool and given the final form of 
the attitude scales. 


2. Behavioral Measures 


On the basis of student statements, social-action behavior was divided 
into the three levels as follows: 

a. Level 1 (actively involved). Participation in a minimum of one “sit in” 
or "stand in." 

b. Level 2 (moderately involved). Participation in some civil-rights activity, 
such as steering committees, making posters, etc.; but “по” 
any "sit ins" or "stand ins." 

c. Level 3 (uninvolved). No participation of any kind in civil-rights 
activities. 

Assignment of $ to one of the levels depended on his satisfying both of the 
following criteria: (a) being identified by a student leader as having partici- 
pated in a particular activity and (5) indicating on a questionnaire, civil- 
rights involvement. In addition, the total number of times $ had been involved 
in civil-rights activities of any type was determined. 


participation in 


3. Attitude Scales 


Three scales were employed: social distance, social action, and semantic 
differential. The Bogardus social-distance scale concerns "direct actions" 
rather than more general feelings about people or situations and seems especial- 
ly appropriate for measuring attitudes toward “social action.” The traditional 
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Bogardus groupings were applied separately to categories, such as Mexican, 
Puerto-Rican, lower-class Negro, Italian, Irish, etc. (2). An overall index o£ 
social distance was obtained by computing the sum of the groupings checked 
for a particular social category and adding all 16 social-category scores; 
therefore, a high score signifies less social distance: i.e., more social contacts 
desired with the various categories. 

Nunnally (9), utilizing a semantic-differential type questionnaire, found 
that "understandability" was an important component of public reaction to 
the mentally ill, and “nervousness or anxiety" is the cardinal sign of mental 
disorder in public thinking. Using Nunnally's findings as a reference point, 
items concerning evaluation, understandability, and adjustment were designed 
for the present research (21 scales in all, randomly polarized) : 

Evaluation was represented by the following eight scales: kind—cruel, 
effective—ineffective, pleasant—unpleasant, intelligent—ignorant, good— bad, 
fair—unfair, wise—foolish, valuable—worthless. 

Understandability was represented by the following six scales: predictable— 
unpredictable, familiar—strange, understandable—mysterious, simple—com- 
plicated, confusing—clear, straight—twisted. 

Adjustment was represented by the following seven scales: relaxed—tense, 
stable—unstable, mature—immature, happy—sad, calm—anxious, unemotional 
—emotional, well-adjusted—maladjusted. | | 

Nine groups considered as "concepts" were to be rated on a nine-point 
centinuum on the aforementioned scales. "These groups included Northern 
Negroes, Southern Negroes, Northern Whites, Southern Whites, Negro Col- 
lege Students Who Take Part In Civil-Rights Demonstrations, Negro College 
Students Who Do Not Take Part In Civil-Rights Demonstrations, White 
College Students Who Help Negroes In Civil-Rights Demonstrations, White 
College Students Who Do Not Take Part In Civil-Rights Demonstrations, 
and Myself. A score for each of the groups of scales (evaluation, understand- 
ability, and adjustment) was obtained by adding the separate ratings on the 
various scales composing a particular group. The higher the rating the more 
Positive it was and indicated a higher evaluation or greater understanding or 
adjustment attributed to a particular concept. : . А 

Student remarks served аз а basis for composing social-action statements. 
These statements were included in a Likert-type instrument that went through 
several stages of purification: an item-analysis was performed as described in 
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4. Background Factors 


A questionnaire was used for obtaining all background information and 
included the following variables: socioeconomic status (Minnesota Occupa- 
tional Scale), region, family disorganization (married, separated, divorced), 
and degree of integration of elementary and high schools attended. 


5. Identical Concepts On Different Instruments 


In order to determine the extent to which the type of instrument employed 
affects expressed attitudes, randomly selected sections of freshmen and soph- 
omores (80 Ss from the original pool) were given Bogardus-type and semantic- 
differential instruments—both of which included the same two terms (North- 
ern Whites and Southern Whites). 


C. RESULTS 
l. Instrument Reliabilities 


The following four-month test-retest reliability coefficients were found: 
social-distance scale, .93; social-action scale, .80; and for all nine concepts 
composing the semantic-differential scale, the median correlation for each of 
the groups of items was .45 for evaluation, .43 for understandability, and 
43 for adjustment [see (10), Chap. 3]. The highest semantic-differential 
reliabilities were found for the “Myself” concept (evaluation .57, under- 
standability .53, and adjustment .62). 


2. Attitudes Versus Behavior 


Examination of relationships among all attitude variables and the be- 
havioral measures reveal no correlation coefficient that accounts for more than 
nine per cent of the variance in social-action behavior. In fact, 80 per cent 
of the Pearson product-moment coefficients are below .15; therefore, if there 
is a relationship between verbalized attitudes and actions (as defined in this 


study) it must be small indeed. 


3. Attitude Scales and Relationships 


When the three attitude instruments (each including several measures of 
attitudes toward social action) were related to each other no coefficient 
greater than .29 could be found (7). [See also Table 1.] As demonstrated in 


LE "Low —— | 


TABLE 1 
N INVOLVEMENT AND 
HO TAKE PART IN Сї 


CORRELATIONS BETWEEN MEASURES ОЕ $остАт.-Аст1о: ATTITUDES TOWARD SOCIAL ACTION, SOCIAL 
DISTANCE, NECRO COLLEGE STUDENTS W. 


VIL-R4GHTS DEMONSTRATIONS, AND SELF* 


" Variable 
Variable Sex 1 2 3 4 5 6 ri 8 9 10 
1. Social-action level M 
F 
2. Social-action frequency M 64 
F 39 
3. Social-action Scale: total score M 26 .29 
F 25 10 
^. Social-Distance Scale: total score M 44 44 29 
F .08 :03 15 
Negro college students who 
take part in civil-rights 
demonstrations: 
5. Evaluation M 25 25 26 —.10 
F 45 10 08 —4A3 
6. Understandability M 17 45 AT 13 .68 
F 15 13 14 —.08 49 
7. Adjustment M 07 05 20 07 .60 .69 
Е 04 —.01 46 —.04 43 47 
Myself 
8. Evaluation M —.09 —.02 —15 —.06 :32 28 34 
F .03 10 02 —.09 42 25 24 
9. Understandability M —.10 —.08 01 -03 27 .38 27 .58 
Е 10 20 .01 —42 .20 27 17 46 
10. Adjustment M — 12 —.18 —.08 —.01 27 25 25 67 68 
F 05 06 -00 —.08 27 27 49 48 44 
Instrument reliability 80 .93 41 43 41 sr .53 .62 
* Ап r of .20 (males) and .14 (females) is re 


quired for significance at the .05 level. N for males — 104; N for females — 
228. Higher social-action level and frequency are considered аз positively related to the more favorable end of the other 
scales: for example, less social distance is favorable, and rating on a semantic-differential scale is scored towards the positive 
(favorable) adjective. , 


$1ЧЯЧОЧ ҸЯЛІТО 'S ANV WAVO YALSAT 
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Table 2, small and nonsignificant correlations (.10 is the largest) were found 
between scores obtained from concepts and associated groups of items that are 
identical except for being embedded in different-type instruments. 

ABLE 2 


CorRELATIONS BETWEEN IDENTICAL CONCEPTS APPEARING IN 
DIFFERENT INSTRUMENTS 


(N = 80) 
Semantic-differential scale 
Evaluation Understandability Adjustment 
Southern Northern Southern Northern Southern Northern 
Bogardus scale whites whites whites whites whites whites 
Southern whites -10 02 .02 .05 .05 .03 
Northern whites 05 07 10 .04 01 .02 


4. Background Factors and Social Action 


Table 3 shows small correlations between the behavioral measures and a 
variety of background factors. 
TABLE 3 


CORRELATIONS BETWEEN MEASURES OF SOCIAL-ACTION INVOLVEMENT 
AND SELECTED BACKGROUND FACTORS* 


Integration 
North- Elementary High Parents’ Occupation 
Social action Sex south school school relationship Father Mother 
Level-action level м —22 48 21 —.12 .03 —.07 
F —04 ^: —.08 —.01 .02 .09 —.01 
Frequency M  —26 .23 18 —12 09 —.07 
F —.09 .02 .07 .01 16 —:04 


* An r of .20 (males) and .14 (females) is required for significance at the .05 level. 
N for males = 104; N for females = 228. In the case of the background variables it 
is assumed that North, higher degree of integration (elementary or high school), 
parents together, and “higher” occupation of either father or mother are positively 
related to greater amount of social-action involvement. 


D. Discussion 


The present study reveals no factors (attitudes or otherwise) significantly 
related to social-action behavior. These findings are in agreement with those 
reported by Roberts and Carr (12) in 1961. The small correlations are not 
due to insufficient reliability. Except for the semantic-differential scale, reli- 
abilities are generally high for all measures; and even with those for the 
semantic-differential scale, the reliabilities are sufficient to support relatively 
large correlations. Certainly the number of cases was large enough to leave 
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little doubt of the inconsequentiality of the correlations. One possibility for 
finding low correlations is attenuation of unreliability; however, even after 
attenuation corrections were made relatively low (and insignificant) correla- 
tions are found. The attitudes are of opinions highly “crystallized”: for ex- 
ample, the opportunity to involve one’s self in civil-rights activities was 
clearly present over a period of two years. The university campus is located 
within walking distance of the demonstration areas, and participation was 
fully supported by both the university administration and faculty. 

One possibility to account for the slight correlations is that individual 
characteristics play a relatively minor role; other, more primary determinants 
of social action being a wide range of social-situational or field forces. Evi- 
dence to support this position was provided by intensive open-end taped inter- 
views with approximately 10 per cent of the Ss at each of the three social- 
action levels. For example, some reasons given by students at Level 3 
(uninvolved) for not taking part in civil-rights activities included parental 
pressure, fear of names being printed in hometown newspapers, numerous 
complaints about the policies and actions of local and national civil-rights 
organizations, and poor grades; while some reasons given by those at Levels 
1 (actively involved) and 2 (moderately involved) for participating included 
a social-activist’s talk, newspaper editorials and general reports, television 
accounts of violence to Negro students, individual or group persuasion. by 
fellow students, educational literature, and appeals of local and national civil- 
rights organizations. я . . 

Perhaps the small and nonsignificant correlations were pig in some 
unknown way by factors peculiar to the investigation. If E t E ge 
point to a surprising inconsistency among three pred use ods of 
studying verbalized attitudes and suggest that доре of t cus Wee 2 Ve 
widely used background and self-concept variables) is pre ictive of real-life 
behavior. In essence, it may well be that (at least in the case of Negro 
American college students) the crucial determinants of social-action involve- 
ment are immediate, contextual field conditions ; and with the impact of these 
social-situational field forces, any consistent individual differences are soon 
overcome. Since for the majority of Negro students their ey d 18 Hot 
static and can be viewed as an open system 1n the aime ^ есопипе, 
this has implications for the present desegregation struge A ). T "erm 
view material emphasized, educational agencies, fia mecs votes AM 
groups can play a significant if not primary role in pede a es 
and stimulating or inhibiting student involvement in socia ; 
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E. SUMMARY 


The level of involvement in and the frequency of social-action activities 
were examined in relation to several conventional scales designed to measure 
attitudes toward social action, self-concept, and background factors. The Ss 
were 501 Negro American freshmen and sophomores in a predominantly Negro, 
private liberal-arts university located in the South. A group of 332 Ss (104 
males and 224 females) was randomly drawn from the total pool and given 
the various scales. The remainder of Ss were utilized for instrument purifi- 
cation and pretesting. Product-moment correlations were calculated among 
the different attitude scales and behavioral measures. 

Substantial test-retest reliability coefficients were found for the attitude 
instruments and high agreement was indicated for the behavioral measures. 
However, practically no significant correlations were found between any of 
the measures of expressed attitudes and overt behavior. Furthermore, small 
correlations were found between the several attitude scales supposedly measur- 
ing the same thing. Moreover, small correlations were found between re- 
sponses given to identical concepts but imbedded in different-type instruments. 
Added to all this, examination of all correlations revealed no factor that 
accounted for more than nine per cent of the variance in social-action be- 
havior. Reasons to account for these findings were discussed. 
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given a series of Thematic Apperception pictures, 


in the manner proscribed by McClelland (2). 
for need achievement. 


and protocols were obtained 
These protocols were scored 


The paired-adjective continua employed were 12 in number and were 
arranged so that each pair was separated by a seven-point scale, Each subject 
was invited to rate his conception of “time” on each of the scales. It appeared 
on inspection that subjects varied considerably both in the means tendency on 
the scales and, more important, on their range of ratings. Accordingly, the 
raw ratings were not subjected to analysis, but were converted into normal- 


ized T' scores for each subject; thus equating for each subject the mean and 
the standard deviation of his ratings on the 12 scales. 


C. RESULTS 


TABLE 1 
CORRELATIONS BETWEEN NEED ACHIEVEMENT AND ADJECTIVE CONTINUA 
Continuum n Continuum r 
clear—vague 24 tense—relaxed .00 
sharp—dull .18 cold—hot —.06 
empty—full 08 sad—happy —її 
active—passive 06 deep—shallow —AI 
young—old 06 soothing—irritating —11 
fast—slow 02 small—large —.20 


Individually, none of the correlations re 
fidence (i.e., .28), and only one exceeds th 
thus, it is clear that our results are not c 
those obtained with our Metaphor Test. 
note that the ordering of our adjective s 
consider those adjectives that correlate 
zero, we find that need achievement correlates Positively with “clear,” “sharp,” 
NN , Shallow,” kr uu d and "large." It correlates negatively sh 

vague," “dull,” “sad,” “deep, soothing,” and “small.” 


e MES "There appears a con- 
firmation of that pattern manifest in the correlations o£ time metaphors with 


ported attains the .01 level of con- 
€ .05 level of confidence (i.e, .21) ; 
omparable in their significance 
АП the same, it may be instructi 
cales does not appear random. If we 
at least one standard deviation from 


with 
ve to 
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need achievement: namely, the tendency for energetic and dynamic images of 
time to be associated with high need achievement. 

Next, we proceeded to a factor analysis of our matrix, comprising correla- 
tions between our 12 adjectival continua and our measure of need achieve- 
ment. Two factors were obtained, though for our purposes we shall present 
only the first and clearly most incisive factor. The loadings on this first factor 
are shown in Table 2. 


TABLE 2 
First Facror Loapincs EMERGING FroM FACTOR ANALYSIS oF 12 ADJECTIVE 
CONTINUA AND NEED-ACHIEVEMENT MEASURE 


Adjective pair Loading Adjective pair Loading 
active—passive —53 small—large ло 
sharp—dull —49 deep—shallow 39 
fast—slow —.46 soothing—irritating 45 
tense—relaxed —.37 empty—full AS 
clear—vague —37 sad—happy 63 
Need achievement —.33 cold—hot ES 
young—old —24 


It will be appreciated that the loadings of each of these adjective scales 
may be reversed in sign by reversing the paired adjectives. If this is borne 
in mind, we find that need achievement is factorially proximate to the fol. 
lowing: active, sharp, fast, tense, clear, hot, happy, full, irritating, and 
shallow. It is factorially alien, on the other hand, from the following adjec- 
tives: Passive, dull, slow, relaxed, vague, deep, soothing, empty, sad, and cold, 


D. Discussion 


The results that we report here are congruent with the implications of our 
earlier study relating preference for time metaphors to measured need achieve- 
ment. In a sense, the present study may be thought of as a replication with 
à variation in one of the dimensions: namely, the mode of measured attitudes 
toward time. It is of some interest to us that we obtained more incisive results 
from our metaphor scale for time than from the semantic differential. Both, 
however, have indicated that high need achievement is associated with an 
energized and dynamic sense of the flow of time; while low need achievement 
is associated with passive and static attributes, Elsewhere we have reviewed 
the theoretical implications of this relationship and shall not pursue it further 
here, But we believe that the dynamic time sense is intimately involved with 
the rise of Western civilization and the emergence of an industrial and 
Capitalistic social order. 
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A STUDY OF PREJUDICE IN NEGRO AND WHITE 
COLLEGE STUDENTS*1 


Department of Psychology, Emory University? 


Luis PRoENZA AND Bonnie R. STRICKLAND 


A. INTRODUCTION 


Despite radical social upheaval in regard to Negro civil rights demands, 
there has been little contemporary research on prejudice and attitude change 
accompanying increased Negro-White activity. The paucity of research is 
such that it is still important to describe and identify prejudicial attitudes. 
The present study is an attempt to document intensity of prejudice among 
Southern and Northern college students, as well as to investigate significant 
methods of assessing prejudice. 

In the main, previous research has supported the belief that Southern whites 
are considerably more prejudiced toward the Negro than are Northern whites 
(7, 9, 10, 12, 13). Moreover, through continuous research since 1925, 
Bogardus has shown that whites of all regions have rated Negroes consistently 
low on the social-distance scale; i.e., white persons have felt a considerable 
amount of social distance between themselves and the Negro. However, some 
Change is taking place, as is evidenced by data (obtained by Bogardus in 
1956) that showed whites to rate Negroes as considerably less socially distant 
than did ratings 10 years earlier. 

The ratings made by Negroes generally show much less social distance 
and prejudice toward whites than do ratings by whites on their attitudes 
toward Negroes (3, 14). However, in data reported in 1953 by Gray and 
Thompson (6) and in 1963 by Fagan and O'Neil (5), the ratings of Negro 
Students toward a combination of different ethnic groups show more social 
distance than do ratings of the same groups by whites. In other words, Negroes 
show more social distance in regard to different ethnic groups than do whites; 
but they generally show less prejudice toward whites than whites do toward 

egroes, 

In regard to sex differences, Pettigrew (9) reported that women in the 
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South are significantly more prejudiced than are males in the South; while no 
significant difference was found betw 


also reported that women show more 
do males; however, in an earlier study, 
women were significantly more favorabl. 

Rating scales of prejudice in the Past, 
questionnaires or interviews that require 


al and Thurstone scales designed to 
measure the concepts “The Negro,” “The Church,” and "capital punish- 
shown to be high and equivalent; and, 
ation between the semantic-differential scale 

und. A study by Tannenbaum—reported by 
Osgood et al, (8)—also showed significant correlations between Bogardus 
social-distance ratings and semantic-differential ratings of Germans, Chinese, 
and Hindus. 


and the Thurstone scale was fo 


B. METHODOLOGY 
Subjects were members of 


introductory-psychology classes, 
agreed to have their classes part 


icipate in psychological experimen 
students had no knowledge of the study until they were asked 
complete the rating scales. All subjects cooperated, 

The population tested consisted of 35 white ma 
a large university located in Ohio; 


a relatively small private university in Atlanta 


Instructors 
tation, but 
in class to 
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naires, but were required to give age and sex, Final groups consisted of 79 
Negro students, 119 white students in Southern schools, and 59 white students 
in a Northern school. 

A booklet was constructed of a modified Bogardus social-distance scale and 
semantic-differential scales for the concepts Negro, white, integration, and 
Segregation. Semantic-differential scales were the same for all subjects. For 
white students, the one form of the Bogardus scale used consisted of seven 
Statements ranging from “I would have Negroes live in my country” to “I 
would marry a Negro." The Negro students rated whites on the same state- 
ments. Instructions for the Bogardus scale were as follows: 

For each of the seven statements below, indicate either True or False 
by checking the appropriate blank. Your answer should be in relation to 
the white [Negro] population in general, and not in relation to any one 
isolated individual you have known. Try to answer as quickly as Possible, 
since first impressions aid in the degree of validity of the question, 


Instructions for the semantic differential were: 


In each of the next four pages, you will find a set of scales which are 
Constructed to measure a particular concept at the top of each page. Rate 
each concept against each of the adjectives according to what the concept 


means to you. 

If you feel the concept you are rating is very personally related to 
either of the adjectives, then place an X in the appropriate direction. 

If the adjectives should be equally related or completely irrelevant, then 


Place an X in the central or neutral space. 
Relatedness should be considered as follows.—very: quite: slightly: 


neutral: slightly: quite: very:— | 
Please work with speed and check every item. 
Be sure you understand the instructions above, 


The semantic differential was composed of 20 pairs of bipolar adjectives 
high on the evaluative factor according to Osgood (8). Such adjectives as 
800d—bad and perfect—imperfect were used. 


C. RESULTS 


No Significant differences in means and standard deviations were found 
between ratings of students of the two white Southern schools. Consequently, 
all white Southern student’s ratings were combined for analysis, 

Means and standard deviations of the Bogardus scale and semantic-dif- 
ferentia] ratings by Negro students, white students in the Northern schools, 
and white students in Southern schools (males and females) are shown in 


Table ЛЬ 
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TABLE 1 
MEANS AND STANDARD DEVIATIONS OF THE BOGARDUS SCALE 
AND THE SEMANTIC-DIFFERENTIAL RATINGS 


Subjects, sex, Semantic-differential scale 


and Bogardus Segre- Inte- 
parameter N scale White Negro gation gration 
Negro 
Male 61 
Mean 6.46 3.50 3.37 5.27 2.52 
SD 67 -67 67 1.05 65 
Female 18 
Mean 6.28 2.95 3.02 5.33 2.56 
SD 56 48 .69 71 -66 
Southern white 
Male 59 
Mean 4.75 3.42 4.18 4.33 3.79 
SD 1.83 52 52 96 88 
Female 60 
Mean 4.83 3.45 3.88 4.55 3.45 
SD 1.08 42 39 90 78 
Northern white 
Male 35 
Mean 5.11 3.25 3.95 4.62 3.54 
SD 1,33 50 -66 -67 76 
Female 24 
Mean 5.83 3.38 3.74 4.53 3.54 
SD 92 .60 .65 1.03 .82 


A two-by-three analysis of variance of the Borgardus ratings gives a highly 
significant F between schools [see Table 2]. Computation of ¢ tests shows 
that the Negroes exhibit significantly less social distance toward whites than 
Southern whites exhibit toward Negroes. No significant differences were 
found between Northern and Southern male ratings on the Bogardus social- 
distance scale; however, white females in the Northern school show signifi- 
cantly less social distance toward Negroes than do white females in the 
Southern schools (2 = 4.15, р < .001). 

On the semantic-differential ratings of the four concepts Negro, white, 
integration, and segregation, ¢ tests reveal no significant differences between 


TABLE 2 
ANALYSIS OF VARIANCE FOR THE BOGARDUS SCALE 
Source of variation df Mean square F 
2.03 1.64 

S 
Race 62.95 50.77* 
Interaction 

Sex-by-race 3.02 244 
Within 251 1.24 

+ 5 « 01. 


2 
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white students of the Northern school and white students of the Southern 

schools; consequently, the semantic differential data of white students were 

combined for analysis. A two-by-two analysis of variance, race by sex, was 

computed for each of the concepts [see Tables 3 and 4]. Significant differences 
TABLE 3 


ANALYSIS OF VARIANCE FOR THE SEMANTIC-DIFFERENTIAL 
Concerts “NEGRO” AND “WHITE” 


Concept 
Negro White 
Source of Mean Mean 
variation df square Р square F 
Sex 1 49 .33 1.04 
Касе 1 25.61 68.54% .03 
Interaction 
Sex-by-race 1 3.78 10.16* 4.03 12.67* 
Within 253 37 32 
*2< 201. 
ТАВІЕ 4 
ANALYSIS OF VARIANCE FOR THE SEMANTIC-DIFFERENTIAL 
Concepts “SEGREGATION” AND “INTEGRATION” 
Concept 
Segregation Integration 
Source of Mean Mean 
Variation df square F square F 
Sex 1 .39 40 
Касе 1 34.31 39.39" 62.36 103.07 
Interaction 
Sex-by-race 1 20 1.83 
Within 253 87 60 
* $ S01, 


between races were found for the ratings of Negro, segregation, and integra- 
tion—with Negro students being more favorable toward the concepts Negro 
and integration and less favorable toward segregation than were whites, Negro 
Students are more favorable toward the concept “White” than are white 
Students toward the concept Negro. On all four concepts, no significant dif- 
ferences were found between sexes; however, significant interactions of sex 
by race were found for the concepts Negro and White. 

To ascertain the relationship between the Bogardus social-distance scale 
and the semantic-differential ratings, Pearsonian correlations were computed 
[see Table 5]. In general, the Bogardus scale relates positively with semantic 
Tatings of segregation and negatively with ratings of integration: i.e., high 
Social distance, favorable evaluation of segregation and less favorable evalu- 
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TABLE 5 
INTERCORRELATIONS OF BOGARDUS SCALE AND SEMANTIC-DIFFERENTIAL RATINGS 


Semantic-differential concept 


Subjects Segre- Inte- 
and sex N White Negro gation gration 
Negro 
Male 61 ges A7 .29* —32* 
Female 18 .30 31 .23* —.06 
Southern white 
Male 59 .32* —.23 .64%% —.48** 
Female 60 06 —.22 .51%% —42%% 
Northern white 
Male 35 .09 —.10 317° —:17 
Female 24 51** —.52* 2999 —.58** 
* < .05. 
** 5 « .01. 


ation of integration. No consistent, significant correlations were found between 
the Bogardus scale and semantic ratings of the concepts Negro and White. 

Table 6 gives the intercorrelations of semantic-differential scales and shows 
significant relationships between the concepts Negro and White and between 
the concepts integration and segregation (the only nonsignificant r is for 
Negro females, a group of 18 subjects). Ratings of Negro students for the 


TABLE 6 


INTERCORRELATIONS OF RATINGS ON SEMANTIC-DIFFERENTIAL SCALE 


Concept and 


Concept 


subject group N White Segregation Integration 
Negro (concept) 
White (subjects) 
Male 94 25% —.48** 5599 
Female 84 .8** —.06 33ee 
Negro (subjects) 
Male 61 43** —.04 29° 
Female 18 49% —34 —.11 
White (concept) 
White (subjects) 
Male 94 05 1$8* 
Female 84 17 0+ 
Negro (subjects) 
Male 61 .21* 22% 
Female 18 .00 42** 
Segregation (concept) 
White (subjects) 
Male 94 —.59*#* 
Female . 84 —J1** 
Negro (subjects) 
Male 61 —A4** 
Female 18 -00 
* p< 05. 
+» $c 01. 
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concept White and white students for, the concept Negro related significantly 
and positively with the ratings of the concept integration. While correlations 
are significant, it should be noted that they are relatively low. 


D. Discussion 


In general, the Bogardus and semantic-differential findings suggest that 
Negro-white prejudicial attitudes contain differing elements of ethnic ac- 
ceptance. The expected differences between Northern and Southern college 
Student ratings were significant only in the case of the Bogardus social- 
distance scale. That significance appears to be due primarily to the finding 
that females in the Northern school showed less social distance toward the 
Negro than females in the Southern school. The semantic-differential ratings 
of the white students show no significant differences in regard to either region 
or sex. The Negro students, however, were considerably more emphatic in 
their favorable evaluation of integration and disfavor of segregation than 
were the white students. 

In the main, the use of evaluative-differential scales appears to give a more 
definitive picture of underlying attitudes than do relatively simple measures 
of prejudice. The use of the semantic differential seems to be appropriate for 
à study of prejudice because of the consistent, significant relationships between 
Certain of the concepts and ratings on the Bogardus scale. The present study 
Used semantic-differential scales composed primarily of adjectives high on the 
evaluative factor. Further research may be pointed toward the use of total 
Semantic space and the relationship of activity, potency, and evaluative factors 
as discussed by Osgood eż al. (8). 

The present study has documented meaningful differences between attitudes 
of white and Negro students in regard to prejudice; however, one should 
Note certain limitations that pose problems in regard to the generalization of 
the results, First, the number of students tested was small, especially the 
8toup of Negro females (N = 18). Second, the schools might be quite dif- 
ferent in regard to the kinds of students that would be in attendance. The 
Northern school was an integrated, large, state-supported University. The 
Southern white schools were a state-supported college and a private church- 
related University. The Negro colleges were private, church-related schools, 
Finally, the generalization of results to persons other than college students 
might be inappropriate. The attitudes of college students, both Negro and 
White, in à large Southern urban area, which has witnessed relatively progres. 
Sive increased Negro-white activity can be expected to be different from the 
attitudes of Southern adults or the attitudes of Southern college students in 
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rural areas or other communities. It will be important to assess other samples 
as well as to identify changes in prejudice, if any occur, with increased civil- 
rights activities and desegregation of schools. It would also be interesting to 
assess Negro students who are attending Northern schools in order to compare 
their evaluations with those of Negro students in the deep South. 


E. SuMMARY 


A modified Bogardus social-distance scale and semantic-differential scales 
for the concepts Negro, White, integration and segregation were administered 
to 79 Negro students and 119 white students attending colleges in the South 
and 59 white students attending a large University in the North. Differences 
in ratings made by the white students of the Northern and Southern schools 
were significant only in the case of the Bogardus scale. Significant differences 
between races were found for the semantic-differential ratings of Negro, 
segregation and integration—with Negro students being more favorable 
toward the concepts Negro and integration and less favorable toward segrega- 
tion than were whites. Negro students were more favorable toward the 
concept White than were white students toward the concept Negro. Signifi- 
cant correlations were found between the Bogardus social-distance and 
semantic-differential ratings of integration and segregation but not with 
ratings of Negro and White. 
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STUDIES IN SOCIAL ATTITUDES: I. VARIATIONS IN 
LATITUDES OF ACCEPTANCE AND REJECTION 
AS A FUNCTION OF VARYING POSITIONS 
ON A CONTROVERSIAL SOCIAL ISSUE* 


Department of Psychology, American University of Beirut 


Lutry N. Dian! 


А. INTRODUCTION 


"The terms latitude of acceptance and latitude of rejection have been recently 
introduced in the social-psychological literature mainly as a result of Sherif's 
and Hovland's researches and publications (5, 6, 7, 8). "Latitude of accep- 
tance" is defined as consisting of "those positions on an issue that the individual 
finds most acceptable, plus other positions also acceptable to him" (5, p. 204); 
While “latitude of rejection” consists of “those positions on the same issue 
that he finds most objectionable, plus other objectionable Positions on the 
issue” (5, pps. 204-205). Sherif proposes that the use of the foregoing con- 
Cepts represents the individual’s stand on an issue much more meaningfully 
than the usual single score or average yielded by most of the conventional 
attitude scales (5, 7). Consequently, a number of studies have been under- 
taken (e.g., 1, 2, 4) for the assessment of latitudes of acceptance and rejection, 
Pointing the usefulness of such a conceptualization of social attitudes in 
Studying effects of communication and attitude change. The main purpose 
of the present study is to retest the following two hypotheses: 

Hypothesis 1. The latitude of rejection of individuals committed to an 
extreme stand on an ego-involving issue is greater than the latitude of 
rejection of individuals with a moderate position on the same issue, 

Hypothesis 2. The latitude of rejection of individuals committed to an 
extreme stand on an ego-involving issue will be relatively greater than 
their latitude of acceptance (5, p. 205). 


In addition to retesting the foregoing hypotheses with a different contro- 
Versial social issue (namely, Arab unity) in a different culture, the present 


Study investigates two additional points: 
gs ttn 
* Received in the Editorial Office, Provincetown, Massachusetts, on May 25, 1964, 
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Hovland (1,7) and as will be explained in the section on procedure. 
2. The study of latitudes of acceptance and rejection of individuals on 
more than one dimension of the same issue. 
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1. Further refinements in the measurement technique for the assessment 

of latitudes of acceptance and rejection, as suggested by Sherif and 
The foregoing points will enable us to test our third hypothesis, which 


may be stated as follows: 


Hypothesis 3. Individuals holding an extreme stand on an ego-involving 
issue show greater consistency among various dimensions of their attitude 


toward the issue than do individuals holding moderate positions on the 
same issue. 


B. METHOD 


at the American University of Beirut. The median and mean ages for the 
total sample were 20 and 20% years respectively. Of this sample, 198 were 
males and 62 were females. In terms of religious membership there were 136 
Moslems, 111 Christians, and the remaining 13 subjects either indicated no 
religious affiliation or else belonged to a religious sect known as Druze. As 
to nationality, 124 were Lebanese and 94 were Jordanians or Palestinians; 


the remaining 42 belonged to other Arab countries: mainly, Syria, 


We 

1. Subjects 

The sample used in this study consisted of 260 undergraduate Arab students 
Sudan and 

Traq. 


2. Procedure 


Data pertinent to the hypotheses and Points were obtained from the sub- 
jects’ responses to a questionnaire on “Arab unity,” 
and controversial social issue in the Arab World2 Th 


questionnaire took place in regular classroom sessions 
15 minutes, 


a highly ego-involving 
е administration of the 


and lasted approximately 
Other than securing background information on the subjects, 
the questionnaire consisted of two sets of nine statements each. One set was 


relevant to the good-bad dimension of Arab unity; the other represented 
the possible-impossible dimension of the same issue. The nine statements for 
each set were chosen after preliminary tests on 30 individuals known to the | 
experimenter to represent different stands on this issue. Thus one set of nine ү 
statements represented a continuum of clearly differentiated Positions on one 
dimension (the evaluative dimension) ; while the other set of nine statements / 
represented a continuum of different positions on the second dimension (the 


? Data for this study were collected during the period extending th the 
months of January, February, and March, 1963. g through 
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possibility dimension). For each dimension, the complete set of nine state- 
ments was mimeographed on each of four sheets of paper. 
On the first sheet, subjects indicated the statement “most acceptable” 

to them; on the second, they indicated other statements also acceptable to 

them. The positions checked on these two sheets provided the data on 

their latitudes of acceptance. On the third sheet, subjects checked the 

statement “most objectionable” to them, and on the last sheet they checked 

other objectionable statements. The checkings on these last two pages 

yielded data on their latitudes of rejection (5, p. 206). 


The above procedure and instructions were repeated for both sets of nine 
statements. The middle and extreme statements on the “evaluative” dimension 
are reproduced below: 

А. The achievement of Arab unity is absolutely essential from all 


angles in the Arab people’s interests. 
Е. From the point of view of the Arab people’s interests, it is hard to 


decide whether Arab unity would be advantageous or disadvantageous. 
1. The achievement of Arab unity is completely detrimental from all 


angles in the Arab people’s interests. 
Similar statements on the “possibility” dimension are also reproduced below: 


A. There is no doubt whatsoever that Arab unity will definitely be 


achieved in the future. : 
E. It is hard to tell whether Arab unity will or will not be achieved 


in the future. 
I. There is no doubt 
at any time in the future. 

Though the procedure for testing the two main hypotheses of this study 
is essentially the same as that followed by Sherif and Hovland (5, 7), it 
differs from theirs in that the hypotheses were tested on more than one 
dimension of the same issue (the “evaluative” and the "possibility" dimensions 
of Arab unity) and in trying out a refinement to the Sherif-Hovland measure- 
ment technique. This refinement in measurement consisted of asking the 
subjects to indicate how strongly they felt about every statement they checked 
in any way (i.e., most acceptable," “а]зо acceptable,” “most objectionable,” 
and “also objectionable” statements), by placing before the statement one of 
the following: VS (Very strongly), 5 (Strongly), or M(Mildly). Such а 
modification of the Sherif-Hovland technique was intended (a) to help reveal 
individuals strongly committed to a moderate or neutral stand and (b) to 
differentiate further between individuals with apparently similar extreme 
stands. Such fine differentiations among various positions on the issue could 
not be made with the original Sherif-Hovland technique. 


whatsoever that Arab unity will never be achieved 


286 JOURNAL OF SOCIAL PSYCHOLOGY 


C. RESULTS 


The main results are summarized in Tables 1 through 5. Subjects were 
classified into extreme pros (those selecting statements A, B, or C as “most 
acceptable" to them), moderates (those selecting statements D, E, or F as 
"most acceptable" to them), and extreme anti-Arab unity subjects (those 
selecting statements С, Н, or I as “most acceptable" to them). As shown in 
Table 1, all three subsamples of subjects reject significantly more items or 


TABLE 1 
DIFFERENCES BETWEEN MEAN NUMBER OF “OBJECTIONABLE” AND “ACCEPTABLE” 
STATEMENTS TO SUBJECTS WITH EXTREME AND MODERATE 
POSITIONS ON THE ARAB-UNITY ISSUE 


a b 
Mean Mean 
number number 
Subject's of items of items a-b 
position N objectionable acceptable Difference t 
Evaluative dimension 
Extreme pro 179 3.46 2.70 0.76 7.465%% 
Moderate 53 2.60 2.26 0.34 2.897%% 
Extreme anti 24 3.21 2.54 0.67 2.320* 
Possibility dimension 
Extreme pro 193 3.73 2.31 1.42 11.455*% 
Moderate 43 2.41 2.46 —0.05 0.298 
Extreme anti 20 2.75 2.55 0.25 0.689 


* Significant at the .05 level. 
** Significant at the .01 level. 


statements than they accept on the evaluative dimension. Further, there are 
no significant differences between the mean number of statements rejected by 
these subsamples on the evaluative dimension. These findings, with the ex- 
ception of those pertaining to moderate subjects, are consistent with the first 
two hypotheses. As to the possibility dimension, even though all of the dif- 
ferences in means were in the expected directions, only the extreme pro-Arab 
unity subjects rejected significantly more items or statements than they 
accepted [see Table 1] and, in addition, they rejected a significantly greater 
number of statements than did the moderate (= 4.98, р < :01) and the 
extreme anti-Arab unity subjects (7 = 2.15, p < .05), with these last twe 
groups not differing significantly in this respect. These findings on the possi- 
bility dimension, with the exception of those pertaining to the extreme anti- 
Arab unity subjects, were in line with expectations. 

To get further clarification of the above results concerning an individual's 
stand and variations in his latitudes of acceptance and rejection, the data were 
regrouped as shown in Tables 2 and 3 by classifying the subjects not only in 


LUTFY N. DIAB 287 


TABLE 2 
DIFFERENCES BETWEEN MEAN NUMBER OF “OBJECTIONABLE” AND 
“ACCEPTABLE” STATEMENTS FOR EXTREME AND MODERATE 
SUBJECTS ON THE ARAB-UNITY IssuE 


a 
Mean 
number Mean 
of items number a-b 
Subject’s position and objection- of items Differ- 
degree of acceptance N able acceptable ence 1 
Evaluative dimension 
/ Extreme pro-Arab unity 
| Very strong and strong 174 3.71 2.72 0.99 7.417% 
Mild 5 3.00 2.20 0.80 1.000 
} Moderate 
TON Very strong and strong 44 2.75 2.27 0.48 3.878% 
d Mild 9 1.88 2.22 —0.34 1.417 
Possibility dimension 
| Extreme pro-Arab unity 
| Very strong and strong 185 3.79 2.32 1.47 11.579* 
Mild 8 2.25 2.00 0.25 0.798 
Moderate 
Very strong and strong " 31 245 2.54 —0.09 0.485 
| Mild 12 2.33 2.25 0.08 0.364 
* Significant at the .01 level. 
TABLE 3 


DIFFERENCE BETWEEN MEAN NUMBER OF “OBJECTIONABLE” STATEMENTS By 
EXTREME AND MODERATE SUBJECTS ON THE ARAB-UNITY ISSUE 


Subject's position Evaluative Possibility 
on.Arab unity dimension imension 
“^7 Extreme pro (very strong 
| and strong) 174 3.71 185 3.79 
Extreme pro (mild) 5 3.00 8 2.25 
Difference 0.71 1.54 
t 1.09 247* 
Moderate (very strong 
and strong) 44 2.75 31 2.45 
oderate (mild) 9 1.88 12 2.33 
Difference 0.67 0.12 
t 2.70* 0.51 
* $7.1, 
' terms of the statement or position “most acceptable" to them but also in 
e terms of the degree to which they accept their “most acceptable” position 
I (i.e, very strong, strong, or mild acceptance of the “most acceptable" pogi. 
\ tion). It will be seen from Table 2 that the extreme pro-Arab unity subjects 


who expressed “mild” acceptance of their stand did not reject significantly 
more items than they accept, a phenomenon that holds true for both dimen- 
sions. Also, while the moderates who expressed “mild” acceptance of their 
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stand on the evaluative dimension accepted more items than they rejected, 
the moderates who expressed "very strong" or "strong" acceptance of their 
moderate stand rejected significantly more items than they accepted. This 
latter finding concerning the moderate subjects on the evaluative dimension, 
though not in line with expectations, becomes understandable in the light of 
strength of commitment to the moderate stand. On the other hand, the 
moderates on the possibility dimension, as expected, did not show significant 
differences between their latitudes of acceptance and rejection, irrespective of 
the degree with which they upheld their stand [see Table 2]. Moreover, as 
shown in Table 3, it is clear that on the evaluative dimension the moderate 
subjects who felt “very strongly” or “strongly” about their position on the 
issue rejected significantly more items than the moderates who expressed 
“mild” acceptance of their position, a finding which does not apply to the 
moderates on the possibility dimension (though the differences were in the 
expected direction). Also, the extreme pro-Arab unity subjects who felt "very 
strongly” or “strongly” about their position on the possibility dimension re- 
jected significantly more items than the extreme pro subjects who expressed 
“mild” acceptance of their position, a finding which does not apply to the 
extreme pro-Arab unity subjects on the evaluative dimension, even though 
the differences were in the expected directions [see Table 3]. Thus, it may be 
stated here that the degree of commitment to a stand may be in itself an 
important factor affecting latitudes of acceptance and rejection. 

To test the third hypothesis of this study (namely, that subjects with 
extreme stands on an issue show greater consistency among various dimensions 
of their attitude than do moderate subjects) Table 4 was constructed. This 
Table shows that, of the total number of extreme pro subjects on the evalu- 
ative dimension, about 94 per cent are also extreme pro on the possibility 
dimension ; however about one-half of the moderates and half of the extreme 
anti-Arab unity subjects on the evaluative dimension are also, respectively, 
moderates and extreme anti-Arab unity on the possibility dimension. A chi- 

TABLE 4 
NUMBER AND PER CENT OF SUBJECTS CHECKING VARIOUS POSITIONS 


AS “Most ACCEPTABLE" ON THE EVALUATIVE DIMENSION 
OF THE ARAB-UNITY ISSUE 


Evaluative dimension 


Possibility Extreme pro Moderate Extreme anti 
Emu N % N % i % 
168 93.86 22 41.50 3 12.50 


Extreme pro 


9 5.03 25 47.18 9 37.50 

: ë 

Modetas © жй 2 111 6 11.32 12 50.00 
кез) 179 100.00 53 100.00 24 100.00 
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square test based on the frequencies in Table 4 yields a value of 144.530, 
significant beyond the .001 level (for four degrees of freedom) and implies 
that the evaluative and possibility dimensions relevant to the Arab unity issue 
are not independent of each other. In other words, the stand of a subject on 
one dimension is likely to be the same on the other dimension. This is parti- 
cularly the case for the extreme pro subjects, less so for the extreme anti- 
Arab unity subjects, and least for the moderates. These findings give general 
support to the third hypothesis of this study, though it is clear that extreme 
anti-Arab unity subjects do not differ significantly from moderate subjects. It 
seems that the greatest consistency between both dimensions holds true mainly 
for the extreme pro-Arab unity subjects. 

As stated earlier in the procedure section, subjects were asked to indicate 
how strongly they feel about the various statements they accept or reject. 
Tabulation of the subjects’ responses in terms of various degrees of acceptance 
for the “most acceptable” position is presented in Table 5. For the evaluative 
dimension, about 97 per cent of the extreme pro and all of the extreme anti- 
Arab unity subjects stated that they felt either “very strongly” or “strongly” 
about the positions they checked as “most acceptable,” which are Positions A, 
B, and C and G, H, and I respectively. However, the moderate subjects also 
expressed similar feelings on the evaluative dimension, for about 83 per cent 
of them stated that they felt either “very strongly” or “strongly” about the 
positions they chose a8 “most acceptable" to them (i.e., Positions D, E, or F), 
and only about 17 per cent of them felt “mildly” about their “most acceptable” 
Position, A chi-square test based on the frequencies shown in Table 5 for the 
evaluative dimension yields a value of 39.737, which is significant beyond 
the .001 level (for four degrees of freedom). Partitioning this chi-square 
value (3) shows that the extreme pro-Arab unity subjects, in comparison to 
the moderates, check either “very strongly” or “strongly” significantly more 
than they check “mildly” to represent the strength of their feeling about their 
“most acceptable” position (x? = 15.920, significant beyond the .001 level 
for one degree of freedom). Also, the extreme anti-Arab unity subjects, in 
contrast to moderate subjects, check either “very strongly” or “strongly” 
significantly more than they check “mildly” to indicate the strength of their 
feeling about their “most acceptable” position (x? = 9.200, significant beyond 
the .01 level for one degree of freedom). However, as expected, the extreme 
Pro and the extreme anti-Arab unity subjects do not differ significantly in the 
degree with which they accept their most acceptable” positions (x? = 0.318, 
not significant for one degree of freedom). 

As to the possibility dimension, it is clear from Table 5 that about 96 per 
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TABLE 5 РА 

NUMBER AND Рев CENT ОЕ SUBJECTS CHECKING Various POSITIONS As "Mosr ACCEPTABLE" ON THE ARAB-UNITY 15808 B 
SHOWING VARIOUS DEGREES OF ACCEPTABILITY 

o 

Evaluative dimension Possibility dimension "i 

Degree of Extreme pro Moderate Extreme anti Extreme pro Moderate Extreme anti а 

acceptability N % N % N % N % N % N % б 

Уегу strongly 137 76.54 18 33.96 17 70.83 122 63.22 18 41.87 14 70.00 a 

Strongly 37 20.67 26 49.05 7 29.17 63 32.64 13 30.23 5 25.00 b 

Mildly 5 2.79 9 16.99 0 0.00 8 4.14 12 27.90 1 5.00 K 

Total 179 100.00 53 100.00 24 100.00 193 100.00 43 100.00 20 100.00 Z 
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cent of the extreme pro and 95 per cent of the extreme anti-Arab unity 
subjects stated that they felt either “very strongly” or “strongly” about 
the positions they checked as “most acceptable” to them. However, only about 
72 per cent of the moderate subjects felt the same way about their own “most 
acceptable” position, and the remaining 28 per cent of moderates stated that 
they felt “mildly.” A chi-square test based on the frequencies presented in 
Table 5 for the possibility dimension yields a value of 27.855, which is signifi- 
cant beyond the .001 level (for four degrees of freedom). Partitioning this 
chi-square value (3) shows that the extreme pro-Arab unity subjects, in 
Contrast to the moderates, check either “very strongly” or "strongly" signi- 
ficantly more than they check "mildly" to indicate the strength of their feeling 
about their “most acceptable" position (x? = 26.340, significant beyond the 
:001 level for one degree of freedom). Also, the extreme anti-Arab unity 
subjects, in comparison to the moderates, check “very strongly” or “strongly” 
significantly more than they check “mildly” to indicate the strength of their 
feeling about their stand on the issue (x? = 9.509, significant beyond the 
:01 level for one degree of freedom). However, as expected, the extreme pro 
and the extreme anti-Arab unity subjects on the possibility dimension do not 
differ significantly in the degree with which they accept their stand 
(x? = 0.175, not significant for one degree of freedom). 


D. Discussion 


The results of this study may best be understood by comparing them with, 
Previous findings on which they were based, mainly represented by Sherif's 
and Hovland's work (1, 5, 6, 7). In line with the findings of these two 
authors, it was found in this study that individuals upholding an extreme 
favorable stand on an issue have latitudes of rejection. greater than the 
latitude of rejection of subjects upholding moderate positions on the same 
issue, though the difference is significant only on the possibility dimension. 
Thus, it is clear that these results pertaining to subjects with an extremely 
favorable stand support the first hypothesis of this study and confirm previous 
findings by Hovland and Sherif (1, 7). On the other hand, the latitude of 
rejection of the extremely anti-Arab unity subjects is also greater than that 
of the moderates; though the difference is not significant on either of the 
two dimensions. Thus, the results pertaining to subjects with an extremely 
unfavorable stand also support, though not significantly, the first hypothesis 
of this study and are in line with previous findings. 

However, in line with the first hypothesis, the extremely pro and the 
extremely anti-Arab unity subjects were not expected to differ significantly 
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from each other in their latitudes of rejection. Our results show that, on the 
possibility dimension, the mean number of items objectionable to the extreme 
pro subjects is significantly greater than the mean number of items objection- 
able to the extreme anti-Arab unity subjects. Though this finding is not in 
line with the implications of the first hypothesis, it may be better understood 
when considered in relation to the nature of the particular social issue under 
consideration: namely, the issue of Arab unity. It may be pointed out here 
that the operation of certain reality checks have caused the extremely anti- 
Arab unity subjects to be reluctant in rejecting the possibility of unity among 
the Arab countries. For one thing, Arab unity materialized in 1958 when 
the United Arab Republic (U.A.R.) was created. Though this unity did not 
last till the time of this study, the present data were collected at a time when 
in Cairo talks for Arab unity and the formation of a new U.A.R. were about 
to be held among Egypt, Iraq, and Syria. Thus, though the extremely anti- 
Arab unity subjects expressed through the positions they chose as “most ac- 
ceptable” to them (1.е., Positions G, H, or I) that they felt Arab unity was 
impossible to achieve, yet they chose to remain noncommittal on a relatively 
large number of the remaining positions on the issue. It appears that subjects 
who take an extremely opposed stand to Arab unity seem to be more reluctant 
and unsure of the practicality of their position, when compared to subjects 
who show an extremely favorable stand. Or perhaps one may speculate in 
this connection about the possibility of different effects produced by positive 
and negative ego involvements and resulting also from the reality checks 
pertaining to the issue at hand. Or, finally, one may raise a question as to 
whether the most-extreme end positions on the issue of Arab unity (i.e., 
Positions A and I) are equally extreme. That is, could it be that Position I 
is not as extreme as Position A? Further research, utilizing larger samples 
of subjects, is needed in order to understand more fully the factors operating 
to produce these differences between the extreme pro and the extreme anti- 
Arab unity subjects. 

Our results seem also to give clear support to our second hypothesis: 
namely, that individuals with extreme stands on an ego-involving issue have 
latitudes of rejection greater than their own latitudes of acceptance. Again, 
in line with Sherif's and Hovland's findings (1, 5, 7), it was found that 
individuals upholding an extreme favorable or unfavorable stand on the evalu- 
ative dimension of the Arab-unity issue tend to reject more positions than 
they accept [see Table 1]. However, on the possibility dimension of the same 
issue, this finding held true mainly for subjects with an extremely favorable 
stand (at Positions A, B, and C) and only partially for subjects with an 
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extremely unfavorable stand [see Table 1]. As shown in Table 1, subjects 
with extremely favorable as well as unfavorable stands reject significantly 
more items than they accept on the evaluative dimension. However, on the 
possibility dimension, though all the differences are in the expected directions, 
only those subjects with an extremely favorable stand rejected significantly 
more items than they accepted [see Table 1]. Here again is another instance 
of an unexpected difference between subjects with extremely favorable and 
those with extremely unfavorable stands toward Arab unity, resulting perhaps 
from different kinds of ego involvement or from the compelling demands 
of reality and suggesting the need for further research to clarify these dis- 
crepancies. 

As a corollary of our second hypothesis, it was expected that subjects with 
moderate stands on the issue of Arab unity should not evidence significant 
differences between their latitudes of acceptance and their latitudes of Tejec- 
tion. As Table 1 shows, such an expectancy held true on the possibility 
dimension but not on the evaluative dimension. In fact, on the evaluative 
dimension, moderate subjects rejected significantly more items than they 
accepted [see Table 1], thus showing similar behavior to subjects with ex- 
treme stands, As stated earlier, though this finding is not in line with our 
expectations, it becomes understandable in the light of the subject’s strength 
of commitment to the moderate stand. As Sherif and Hovland remark, "provi- 
sion must also be made, however, for the occasional individual who strongly 
upholds a position in the middle segments and rejects positions at the ex- 
tremes” (7, p. 190). The results of the breakdown of the sample in terms 
of the subjects’ stand as well as the degree with which they are committed 
to that stand, presented in Tables 3 and 4, lead us to conclude that the 
degree with which a subject upholds his stand, regardless of whether it is a 
moderate or an extreme stand [see Tables 3 and 4], is in itself an important 
factor determining the relative sizes of his latitudes of acceptance and rejec- 
tion, 

Finally, our results tended to support our third hypothesis: namely, that 
subjects with extreme stands showed greater consistency among their attitude 
components than did moderate subjects [see Table 5]. However, the greatest 
Consistency was exhibited by the extreme pro subjects, while the extreme 
anti-Arab unity and the moderate subjects did not differ appreciably from 
each other, Again, we evidence here the irregular behavior of those subjects 
that have an extremely unfavorable stand towards Arab unity, in contrast to 
the consistent behavior of the subjects who have an extremely favorable stand. 
It is obvious here that while the evaluative and the possibility dimensions of 
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Arab unity appear to be strongly related when subjects with an extremely 
favorable stand are considered, the relationship tends to weaken considerably 
in the case of moderate as well as extreme anti-Arab unity subjects. This 
relative lack of consistency in the case of subjects with extremely unfavorable 
stands, in comparison to that of subjects with extremely favorable stands, may 
be due to a variety of factors. For one thing, subjects with an extremely un- 
favorable stand towards Arab unity may either feel that Arab unity has some 
advantages though it appears impossible to achieve or they may feel that it 
is very bad but yet has some chances of being achieved. In other words, it 
seems that the relative lack of consistency in the case of the extreme anti-Arab 
unity subjects may be the result of their extreme stand on one dimension 
and, at the same time, their less extreme stand (i.e., upholding a moderate or 
even an extreme pro position) on the other dimension. In either case, it 
seems that such relatively inconsistent behavior is caused by the fact that 
extreme negative stand on both dimensions may not be very realistic. On the 
other hand, the extremely consistent behavior of individuals upholding an 
extremely favorable stand towards Arab unity may be, at the same time, due 
to blindness to and denial of reality on the part of these subjects, resulting 
from extreme positive ego involvement. As suggested earlier, there is the pos- 
sibility of different effects produced by positive and negative ego involvements 
on an issue. However, the need for further research on these and on related 
points is indicated by our results. 


E. SUMMARY 


The purpose of the present study was to investigate variations in the 
latitudes of acceptance and rejection as a function of varying positions on 
a controversial social issue: namely, the issue of Arab unity. 

The procedure used for testing the hypotheses of this study involved the 
administration of a questionnaire which was essentially similar to that used 
by Sherif and Hovland (5, 7). The sample consisted of 260 undergraduate 
Arab students at the American University of Beirut. 

The study showed, in line with previous findings by Sherif and Hovland, 
(a) that the latitude of rejection of subjects upholding extreme stands is 
greater than the latitude of rejection of subjects upholding moderate stands 
and (b) that the latitude of rejection of subjects upholding extreme stands is 
greater than their own latitude of acceptance. However, the results pertaining 
to subjects upholding an extremely negative stand towards Arab unity were 
not fully in line with expectations. The explanation given to account for 
this discrepancy was based on reality demands relevant to the issue under 
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consideration. Furthermore, the results pertaining to subjects upholding a 
moderate stand also were not fully in line with our hypotheses. Such a dis- 
crepancy was mainly accounted for by taking the subject’s strength of com- 
mitment to his moderate stand into consideration. 

Finally, our results have shown that subjects with extreme stands exhibit 
greater consistency among their attitudinal dimensions than do moderate 
subjects. However, subjects upholding an extremely favorable stand towards 
Arab unity showed a much greater consistency than did subjects with an 
extremely unfavorable stand. This discrepancy was explained in terms of the 
possible differential effects of positive and negative ego involvements in rela- 
tion to the social issue in question and to the operation of reality checks, 

The need for further research, utilizing larger samples of subjects, is 
indicated. This is particularly essential in relation to findings on subjects 
upholding extremely negative stands toward Arab unity. 
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А. INTRODUCTION 


A number of studies (1, 2, 4, 5) have shown that individuals upholding 
Strong stands on an issue will have a“... raised threshold of acceptance and 
lowered threshold of rejection" (4, p. 202). 

In line with the foregoing studies, the author speculated that an individual's 
stand on an issue should determine the kinds of mass communication media to 
which he exposes himself; hence this study is an attempt to provide an answer 
to the question that may be presented in the form of an hypothesis as follows: 
Individuals upholding an extreme positive or negative stand on an ego-in- 
volving issue are selective in the kinds of mass communication media to which 
they expose themselves, the selectivity being in the direction of media repre- 
senting their own points of view to the exclusion of media expressing opposite 
Points of view. 

B. METHOD 
l. Subjects 


The same sample used in a recent study by the author (1) was utilized in 
this study. It consisted of 260 undergraduate Arab students at the American 
University of Beirut (198 males and 62 females) with a mean age of 20% 
Years. One hundred thirty-six were Moslems, and 111 were Christians; the 
Temainder indicated no religious preference or belonged to a religious sect 
known as Druze. One hundred twenty-four were Lebanese and 94 were 
Jordanians or Palestinians; the remainder, mainly, were Syrians, Sudanese, 
and Iraqis. 

2. Procedure 

The questionnaire utilized in an earlier study by the author (1) contained 
questions relevant to the hypothesis of the present study. These questions asked 
the subject to indicate by name (a) the newspapers he reads or prefers to 
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read often and (5) the radio stations he listens to or prefers to listen to 
often. The questionnaire was entitled “A Questionnaire on Arab Unity” 
and was given in regular classroom sessions. It took approximately 15 minutes. 
As reported earlier (1), subjects were asked to express their stand on the 
good-bad (evaluative) dimension of Arab unity, by checking one out of nine 
statements (A to I) “most acceptable.” Subjects checking A, B, or C posi- 
tions (or statements) were classified as extreme pro-Arab unity; those checking 
D, E, or F were considered moderates; finally, subjects checking G, H, or I 
positions as “most acceptable” were classified as extreme anti-Arab unity. 

Prior to collecting and tabulating the subjects’ responses, a categorization 
of newspapers and radio stations available in the Arab Middle East area was 
prepared. This categorization was based on majority agreement among 10 
judges, who were asked to place each of many mass media in one of the 
following categories: I. Pro-Arab unity newspapers and radio stations, II. 
“Міхей” newspapers and radio stations (i.e., neutral or foreign), III. Anti- 
Arab unity newspapers and radio stations. 'The criterion used for the place- 
ment of the media in any one of the categories was the placement of that 
medium in one category by six or more of the judges. The 10 judges (univer- 
sity professors, journalists, and graduate students) represented different posi- 
tions on the issue of Arab unity. 


C. RESULTS 


The main results are presented in Tables 1 and 2. Table 1 shows that the 
results pertaining to the "reading" behavior of the extreme pro-Arab unity 
subjects and, particularly, the extreme anti-Arab unity subjects seem to give 
strong support to the hypothesis of this study. Though moderate subjects were 
not specifically referred to in our hypothesis, it was implied that they would 
show (in comparison to extreme subjects) less selectivity in the kinds of mass 
communication media to which they expose themselves; however, it appears 


TABLE 1 
NUMBER AND PER CENT OF SUBJECTS SHOWING PREFERENCES FOR 
“NEWSPAPERS” EXPRESSING DIFFERENT POINTS OF View 
ON THE ARAB-UNITY ISSUE 


"Newspapers" preferred 


Pro-Arab A Anti-Arab 
Evaluative unity ixe unity 
dimension N N 96 N 96 N % 
Extreme pro-Arab 166 57 34.34 78 46.99 31 18.67 
Moderate 47 5 10.64 17 36.17 25 53.19 


Extreme anti 24 1 4.17 3 12.50 20 83.33 
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TABLE 2 
NUMBER AND PER CENT OF SUBJECTS SHOWING PREFERENCES FOR 
“RADIO STATIONS" EXPRESSING DIFFERENT POINTS OF 
VIEW ON THE ARAB-UNITY ISSUE 


"Radio stations" preferred 


Pro-Arab Anti-Arab 
Evaluative unity Mixed unity 
dimension N N % N % N % 
Extreme pro-Arab 163 92 56.44 60 36.81 11 6.75 
Moderate 49 10 2041 29 59.18 10 20.41 
Extreme anti 22 1 4.54 13 59.10 8 36.36 


that the greatest proportion of moderate subjects (about 53 per cent) prefer 
to read anti-Arab unity papers, while only about 11 per cent read pro-Arab 
unity papers. A chi-square test, based on the frequencies shown in Table 1, 
yields a value of 54.712 (which is significant beyond the .001 level [df = 4]) 
and implies that the kinds of newspapers read by our subjects are not indepen- 
dent of the subjects’ stands on the issue of Arab unity. Partitioning of the 
chi-square value (3) shows that the extreme pro-Arab unity subjects, in 
comparison to the extreme anti-Arab unity subjects, prefer to read pro-Arab 
unity papers significantly more than they read anti-Arab unity papers and 
vice versa (y? = 30.450, significant beyond the .001 level for one degree of 
freedom). 

Table 2 shows that the findings relevant to the "listening" behavior of 
subjects upholding different stands on the Arab-unity issue give strong support 
to our hypothesis; thus, while the greatest proportion of the extreme pro-Arab 
unity subjects (about 56 per cent) prefer to listen often to pro-Arab unity 
radio stations, only about seven per cent listen to anti-Arab unity stations. 
Also, while only about five per cent of the extreme anti-Arab unity subjects 
listen to pro-Arab unity stations, about 36 per cent of them prefer to listen 
to anti-Arab unity stations. Final, in line with the implications of our 
hypothesis, equal proportions of moderate subjects listen to pro-Arab unity 
as well as to anti-Arab unity stations. A chi-square test, based on the fre- 
quencies shown in Table 2, yields a value of 42.284, significant beyond the 
-001 level (df — 4) and implies that the kinds of radio stations peferred by 
Our subjects are not independent of the subject's attitudes towards Arab unity. 
Partitioning this chi-square value (3) shows that the extreme pro-Arab unity 
Subjects, in comparison to extreme anti-Arab unity subjects, prefer to listen 
to pro-Arab unity stations significantly more than they listen to anti-Arab 
unity stations and vice versa (72 = 14.556, significant beyond the .001 level 


for one degree of freedom). 
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D. Discussion 


The results of this study may be better understood if a point concerning 
procedure is clarified and made more explicit. It will be recalled that our 
subjects were asked to write the names of various newspapers and radio 
stations that they like or prefer. "There is no sure way of telling the reasons 
that a subject may have for expressing preference to a particular newspaper 
or radio station, but a subject preferring to read a pro-Arab unity paper 
rather than an anti-Arab unity paper does so, most likely, because of the 
particular political orientation of the paper. However, we cannot be equally 
sure that a subject prefers to listen often to a particular radio station because 
of the latter’s viewpoint on the issue of Arab unity. After all, there is a like- 
lihood that the subject may prefer a certain radio station because it offers 
him the kind and variety of musical programs that he happens to like. Further- 
more, there is no financial expenditure involved in listening to the radio, as 
in obtaining one's favorite newspaper (besides the fact that “reading” involves 
more active effort on the part of the individual than “listening”). In spite 
of the foregoing, one should not infer that the subjects’ preferences for various 
media were not mostly determined by the political coloring of the media, 
particularly in the Arab Middle East. In addition, as mentioned earlier, 
the questionnaire was entitled “A Questionnaire on Arab Unity,” which fact 
should put the subject in this particular context and have his preferences 
expressed mainly in relation to the issue in question. 

In line with our hypothesis, subjects with an extreme positive stand towards 
Arab unity are highly selective in the kinds of radio stations they listen to 
(preferring pro-Arab unity and avoiding anti-Arab unity stations) and they 
also show considerable selectivity in the kinds of newspapers they read, How- 
ever, about 66 per cent of the pro-Arab unity subjects [see Table 1] reported 
that they often read one or more “mixed” as well as anti-Arab unity papers. 
Yet, a further analysis of this result shows that the greatest proportion of these 
subjects who read “mixed” and anti-Arab unity papers (about 60 per cent of 
them) read only one particular newspaper (classified as anti-Arab unity by 
the majority of our 10 judges). This particular paper has special appeal to 
our pro-Arab unity subjects mainly as a result of a daily editorial which it 
includes, often revolving around the issue of Arab unity and the various 
problems that relate to it and which may be characterized as being analytical 
rather than critical of Arab unity. Thus, the apparent lack of strong selec- 
tivity in the kinds of newspapers read by our pro-Arab unity subjects is now 
more understandable in the light of this information. As our results show, 
about one-third of our pro-Arab unity subjects prefer to read pro-Arab unity 
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papers; while about two-thirds of them also reported preference for reading 
all other kinds of papers, including anti-Arab unity papers (of which one 
particular paper has special appeal because of its analytical daily editorial on 
the Arab World and its day-to-day problems). Thus, it appears that about 
two-thirds of our pro-Arab unity subjects are eager to learn also about various 
Points of view other than their own. Being strongly and positively ego involved 
in the issue of Arab unity, the majority of our pro-Arab unity subjects are 
highly motivated to find out what problems stand in the way of achieving 
this unity. It is possible that such a finding would not be applicable to un- 
educated and nonuniversity pro-Arab unity subjects, where much greater 
selectivity in mass media would be expected. Such a point is worth in- 
vestigating. 

In line with our hypothesis, anti-Arab unity subjects show high selectivity 
in the kinds of newspapers they read, preferring anti-Arab unity newspapers 
and almost refusing to read pro-Arab unity papers. However, when it comes 
to radio station preferences, about 59 per cent of the anti-Arab unity sub- 
jects were found to prefer mostly "neutral" and “foreign” radio stations, with 
about 36 per cent reporting that they prefer to listen to anti-Arab unity 
Stations, Thus, there is evidence for selectivity as well as withdrawal in the 
"reading" and "listening" behavior of our anti-Arab unity subjects. "These 
findings about anti-Arab unity subjects may indicate a tendency on their 
Part to run away from or avoid reality, caused by an underlying feeling of 
threat and insecurity that may accompany their minority-group status. In 
other words, we may have here an instance of perceptual defense contributing 
to the preservation and stability of one's already existent attitudes. On the 
other hand, it might be that the anti-Arab unity subjects, in contrast to the 
Pro-Arab unity subjects, do not experience any great need for knowing and 
discovering more about the Arab unity issue. To them, Arab unity is impos- 
sible to achieve or disadvantageous to the Arab countries; hence there is no 
need to know anything more about it. Furthermore, one may add that most 
of the anti-Arab unity subjects seem to obtain their information from anti- 
Arab unity papers and from “neutral” and “foreign” as well as anti-Arab 
unity radio stations. 

Finally, subjects with a moderate stand towards the Arab-unity issue were 
found to behave in a fashion highly similar to the anti-Arab unity subjects, 
Particularly in connection with the kinds of newspapers they prefer to read, 
In fact, it was reported earlier that moderate subjects behaved very much 
like anti-Arab unity subjects in that they rejected significantly more items 
than they accepted and in that the great majority of them rejected the favor- 


302 JOURNAL OF SOCIAL PSYCHOLOGY 


able end of the “evaluative” and "possibility" continua of Arab unity (1). 
Thus, the present finding on moderate subjects, though not fully in line with 
the implications of our hypothesis, confirms earlier findings (1) about the 
similarity in the reactions of moderate and anti-Arab unity subjects. Such 
a finding leads us to doubt the neutral position of so-called moderate subjects, 
as measured by direct questions. 


E. SUMMARY 


'The purpose of the present study was to study selectivity in mass-com- 
munication media as a function of one's stand on a controversial social issue: 
namely, the issue of Arab unity. 

‘The sample used in this study consisted of 260 undergraduate Arab students 
at the American University of Beirut. The subjects were given a questionnaire 
on Arab unity (1) on the basis of which their stand towards this issue was 
determined ; also, subjects were asked to indicate their preferences for various 
newspapers and radio stations available in the Arab Middle East area. 

In general, the results demonstrate that subjects with an extreme stand 
on the issue of Arab unity (particularly those with an extreme negative stand ) 
show high selectivity in the kinds of mass communication media to which 
they expose themselves, the selectivity being in the form of preferences for 
media (i.e., newspapers and radio stations) expressing points of view similar 
to their own. However, subjects with a moderate stand on the issue of Arab 
unity also showed behavior similar to that of the extreme anti-Arab unity 
subjects. "This result raises questions as to the adequacy of the attitude- 
measuring technique that was employed in this and in similar previous studies 
in the assessment of “moderate” or “neutral” attitudes. 
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STUDIES IN SOCIAL ATTITUDES: III. ATTITUDE 
ASSESSMENT THROUGH THE SEMANTIC- 
DIFFERENTIAL TECHNIQUE*: 


Department of Psychology, American University of Beirut 
Lutry N. Dias 


А. INTRODUCTION 


In a recent study reported by the author (1), it was found that subjects 
with a moderate stand on the issue of Arab unity, as measured by the Sherif- 
Hovland technique (6, 7), rejected significantly more items than they ac- 
cepted from a total of nine items (A to I) representing the evaluative (good- 
bad) dimension of Arab unity. Such a finding was expected only for subjects 
With an extreme positive or negative stand toward the issue in question, but 
not for those with a moderate stand. In a follow-up study by the same author 
(2), it was also found that subjects with a moderate stand on the Arab- 
unity issue showed, in their reactions to various kinds of mass-communication 
media, selective behavior similar to that of the extreme anti-Arab unity sub- 
jects. Thus, doubt was then expressed as to the neutrality of so-called 
moderate subjects, as measured by the Sherif-Hovland technique (2, 7). 

The present study attempts to find explanations for the foregoing dis- 
Crepancies, by utilizing the semantic-differential technique which, according 
to its authors (5), should enable one to get a more-complete picture of the 
meaning of "Arab unity" than is obtained through the evaluative dimension 
alone. Furthermore, the use of more than one evaluative scale to represent 
the evaluative dimension should give a more-exact estimate of the subjects’ 
attitude scores. Finally, the use of a variety of semantic-differential scales, 
representing various dimensions of meaning, should clarify the behavior of 
Subjects with extreme positive or negative stands toward the issue of "Arab 
unity," 

According to Osgood, Suci, and Tannenbaum, “. . . to index attitude we 
would use sets of scales which have high loadings on the evaluative factor 
across concepts generally and negligible loadings on other factors . . ." (5, 
P. 191). Thus, to Osgood eż al. attitude is simply identified as the evaluative 
—— 
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dimension of the total semantic space. They also report that in ali ot the 
factor analyses done by them the evaluative dimension accounted "for the 
largest proportion of the total variance” (5, p. 190). Furthermore, in meas- 
uring attitude, they state “. . . we usually include a considerable number of 
scales representing other factors—this is done both to obscure somewhat the 
purpose of the measurement and to provide additional information on the 
meaning of the concept as a whole, aside from the attitude toward it" (5, p. 
191). Thus, our present choice of the semantic differential in foto as а meas- 
uring instrument was based on the evidence presented by Osgood et al., which 
shows that the differential provides “а richer picture of the meaning of the 
attitude object than just the evaluative dimension . . ." (5, p. 195). Osgood 
et al. conclude that "by combining judgments derived from scales represent- 
ing other dimensions with those derived from the evaluative factor alone, 
additional information can be obtained and prediction presumably improved" 
(5, p. 198). In the light of accumulated evidence, Osgood and his colleagues 
report the cross-cultural and cross-linguistic existence of the same three in- 
dependent factors or dimensions that were found with American subjects (3). 
These factors are designated the evaluative, potency, and activity factors of 
meaning. Our present interest was in selecting scales with high loadings on 
the evaluative factor for the Arabic language and in the use of such scales 
(consisting of pairs of adjectival opposites) as a basis for judging the con- 
cept "Arab unity." Such scales, representing the evaluative factor, as well as 
other scales representing other factors in the semantic space for our Arab 
subjects, were available in a mimeographed report by Osgood, Archer, and 
Miron (4); hence the present study will make use of a number of semantic- 
differential scales, representing the evaluative as well as other factors of 
meaning, in an attempt to clarify some of the unexpected findings concerning 
moderate subjects (1, 2) and to throw more light on the meaning of “Arab 
unity" to extreme pro-Arab unity as well as to extreme anti-Arab unity 
subjects. 
B. METHOD 
1. Subjects 


'The same sample used earlier by the author (1, 2) was also utilized in 
this study. It consisted of 260 undergraduate Arab students at the American 
University of Beirut (198 males and 62 females) with a mean age of 20% 
vears for the total sample. As to religion, 136 were Moslems, 111 were 
Christians, and the rest indicated no religion or belonged to the Druze sect. 
In terms of nationality, 124 were Lebanese, 94 were Jordanians or Pales- 
tinians, and the rest were mainly Syrians, Sudanese, and Iraqis. 
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2. Procedure 


' In addition to information obtained previously and mentioned in earlier 
studies (1, 2), the questionnaire administered anonymously to the subjects 
included also a series of 13 different scales against which the concept of 
"Arab unity" was to be judged, following exactly Osgood's procedure and 
instructions (4, 5). Four of these scales have very high loadings on the 
"evaluative" factor and negligible loadings on the other factors (4). "These 
were the good-bad, beneficial-injurious, safe-dangerous, and great-contempt- 
ible scales. Another set of three scales were chosen (4) as representative of 
the “potency” factor. These were the slow-fast, strong-weak, and rational- 
emotional scales. The remaining six scales were selected in such a way as to 
represent the "activity" and other unidentified factors or dimensions, particu- 
larly those that appeared to the author to be most relevant to the issue of 
Arab unity. 
C. RESULTS 

The use of the semantic-differential technique was intended, among other 
things, to provide further clarifications for some of the unexpected findings 
concerning moderate subjects, reached in earlier studies (1, 2). As stated in 
the introduction, subjects with a moderate stand towards Arab unity (those 
Checking Positions D, E, or F as "most acceptable" from a good-bad con- 
tinuum of nine statements—A to I) were found to reject a significantly 
Sreater number of statements than they accepted. Such an unexpected finding 
for moderate subjects was previously accounted for in terms of the strong 
commitment to the moderate stand exhibited by the majority of moderate 
subjects (1). In addition to this explanation, the use of several semantic- 
differential evaluation scales has helped to shed more light on the behavior 
of moderate subjects. As shown in Table 1, our so-called moderate subjects, 
as measured by the Sherif-Hovland technique (1, 6, 7), do not seem to be 
Moderate, as measured by their mean evaluative ratings on four semantic- 
differential evaluative scales. In fact, moderate subjects at Position D have 
a mean evaluative rating of 2.28 on a seven-point scale, which means that 
they feel that Arab unity is mostly "quite good, beneficial, safe, and great.” 
On the other hand, moderate subjects at Position F have а mean evaluative 
Tating of 5.96, which indicates that Arab unity is, teom their point of view, 
= bad, injurious, dangerous, and contemptible.” Only those moderate 
Subjects at Position E turned out to be near the point of neutrality, with a 
mean evaluative rating of 3.81, indicating that Arab unity is to them “equally 
good and bad." Thus, it appears from these results that more than half of 
our so-called moderate subjects (about 62 per cent) are not moderates, but 


ТАВГЕ 1 
MEAN RATINGS OF THE CONCEPT "ARAB UNITY" ON 13 SEMANTIC-DIFFERENTIAL SCALES 


Position checked “Most Acceptable" on evaluative dimension 


A B € D E F G H I 
Dimension (N —99) (N —60) (N —19) (= 25) (N 220) (N 28) (N—6) (N=7) (N=11) 
Evaluative scales 
1. Good-bad 1.12 1.35 1.68 2.04 3.45 5.75 6.00 5.71 6.18 
2. Beneficial-injurious 1.19 145 2.00 1.92 3.65 6.12 5.00 6.43 5.82 
3. Safe-dangerous 1.38 2.10 1.79 3.00 430 6.12 5.33 6.71 5.82 
4. Great-contemptible 1.34 1.58 2.05 2.16 3.85 5.87 5.33 5.85 5.73 
Mean 1.26 1.62 1.88 2.28 3.81 5.96 541 6.17 5.89 
Potency scales 
5. Strong-weak 143 1.81 2.10 2.68 425 5.87 5.16 6.71 5.55 
6. Rational-emotional 2.96 3.43 447 472 5.20 6.87 4.33 7.00 6.00 
7. Fast-slow 4.04 4.30 4.42 5.88 5.85 4.75 6.33 5.86 7.00 
Mean 2.10 3.18 3.66 4.43 5.10 5.83 5.27 6.52 6.18 
Other scales 
8. Near-far 2.71 3.36 3.89 5.28 5.22 6.87 5.50 6.85 6.18 
9. Active-passive 1.58 2.11 2.79 3.48 3.70 6.62 5.00 5.14 5.45 
10. Simple-difficult 4.73 5.43 5/7 5.96 6.00 6.87 5.66 6.71 6,73 
11. Real-imaginary 141 1.80 2.58 3.92 3.90 6.62 5.33 6.71 6.00 
12. Possible-impossible* 1.51 1.63 2.16 3.28 4.50 6.75 6.52 5.28 6.36 
13. Predictable- 
unpredictable* 1.90 2.65 2.84 4.08 4.75 6.50 5.83 5.43 5.64 


* With the exception of the last two scales, which were added by the author because of their relevance to the concept in 
question, all other scales were taken from factor analyses on a variety of scales for the Arabic language and reported by 
Osgood, Archer, and Miron (4). 
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are, instead, either for or against Arab unity [see Table 1]. In other words, 
more than half of our moderate subjects should have been classified, in the 
light of this additional information provided by the use of four evaluative 
scales of the semantic-differential technique, as either extreme pro-Arab unity 
or extreme anti-Arab unity subjects rather than moderate subjects. 

Furthermore, as shown in Table 1, about 53 per cent of our so-called 
moderate subjects have mean potency ratings similar to those of extreme anti- 
Arab unity subjects, indicating their feelings that Arab unity is to them 
“quite weak, emotional, and slow.” In addition, moderates have similar mean 
ratings as anti-Arab unity subjects on such scales as "near—far," “simple— 
difficult,” and “predictable—unpredictable,” Thus, on the basis of these 
findings on our so-called moderate subjects, it is clear that the meaning of 
“Arab unity” to them has a lot in common with what this concept means to 
extreme anti-Arab unity subjects, 

Finally, this study was also intended to throw more light on the meaning 
of "Arab unity" to subjects with extreme positive as well as negative stands 
towards the issue. Thus, as shown in Table 2, the sample was divided into 
Seven groups of subjects, depending on the rating they gave for the concept 
“Arab unity” on the seven-point good-bad scale. For example, there were 
151 subjects who gave a rating of 1.00 on the good-bad scale, indicating their 
feeling that Arab unity is “very good”; while, on the other hand, there were 
22 subjects who gave a rating of 7.00 on the same scale, indicating their 
feeling that Arab unity is “very bad.” Our results indicate that those subjects 
who evaluate Arab unity "quite" or “very” highly (i.e. subjects with an ex- 
treme positive stand or attitude towards Arab unity) do not seem to have 
Tatings of equal magnitudes on scales representing dimensions or factors 
other than the evaluative factor. As Table 2 shows, these same pro-Arab 
unity subjects feel that Arab unity is either slightly potent or somewhere near 
the point of neutrality as far as potency is concerned. Furthermore, they also 
feel that Arab unity is "quite or slightly active, real, possible, and predictable” 
but also “slightly far and quite or slightly difficult” to achieve [see Table 2]. 

As to subjects with an extreme negative stand toward the Arab unity issue 
(i.e, those having mean evaluative ratings of about 6.00 or above and thus 
indicating that Arab unity is to them "quite bad" or "very bad"), these sub- 
Jects appear to have ratings of equal magnitudes on scales representing factors 
other than the evaluative factor. Thus, these same anti-Arab unity subjects 
also feel that Arab unity is “quite or very low in potency” as well as “quite 
От very far, passive, difficult, imaginary, impossible, and unpredictable” [see 
Table 2]. In the case of so-called moderate subjects (i.e., those having mean 


ТАВГЕ 2 
Mean Соор-Вар RATINGS or тне CONCEPT “ARAB UNITY” on 13 SEMANTIC-DIFFERENTIAL SCALES 
Good: Bad: 
very quite slightly equally slightly quite very 
Scale (N=151) (V=56) (N=7) (N = 12) (N22) (у=?) (N=22) 
Evaluative scales 
1. Good-bad 1.00 2.00 3.00 4.00 5.00 6.00 7.00 
2. Beneficial-injurious 1.05 2.00 3.14 3.50 4.50 5.57 6.59 
3. Safe-dangerous 1.56 2.87 3.00 4.25 5.50 6.00 6.36 
4. Great-contemptible 1.35 245 2.57 3.67 3.00 +71 6.6+ 
Меап 1.24 2.33 2.93 3.85 4.50 5.57 6.65 
Potency scales 
5. Strong-weak 1,48 2.68 2.86 3.50 6.50 6.00 6.59 
6. Rational-emotional 3.02 4.38 4.86 5.83 5.50 5.86 6.18 
7. Fast-slow 4.10 5.23 5.14 5.67 6.50 4.71 6.86 
Mean 2.87 4.10 4.29 5.00 6.17 5.52 6.54 
Other scales 
8. Near-far 2.93 4.71 4.14 4.67 6.50 6.57 6.59 
9. Active-passive 1.73 3.14 3.00 3.83 4.00 6.57 5.73 
10. Simple-difficult 5.05 5.59 5.43 6.17 7.00 6.28 6.59 
11. Real-imaginary 1.56 2.96 3.43 447 6.00 6.14 6.50 
12. Possible-impossible* 1.54 2.78 3.29 442 4.50 6.14 6.77 
13. Predictable- 
unpredictable* 2.16 3.55 3.86 4.58 4.50 6.00 6.09 
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evaluative ratings of about 4.00 and thus feeling that Arab unity is about 
“equally good and bad”), it appears that they judge “Arab unity” on the 
Potency factor in a manner similar to that shown by the anti-Arab unity sub- 
jects, as indicated by their mean potency rating of 5.00 across all three potency 
scales [see Table 2]. Also, moderate subjects on the evaluative factor judge 
“Arab unity,” similar to anti-Arab unity subjects, as being “slightly far, 
impossible and unpredictable” as well as “quite difficult” to achieve. 


D. Discussion 


The results of this study demonstrated that the use of various semantic- 
differential scales (representative of the evaluative, potency, and other factors 
of meaning), in contrast to the use of scales representative of the evaluative 
factor alone, results in a more-complete picture about the attitude object and 
does help to improve prediction. As stated by Osgood eż al. (5, p. 198), “two 
People may have identical attitudes toward a concept (as determined by al- 
location to the evaluative dimension alone), and yet have quite different 
Meanings of the concept (as determined by the profiles as wholes).” 

The results presented in Table 1 show a comparison of the information 
that can be obtained from using scales representative of various factors of 
meaning to that which was obtained on the same subjects in an earlier study, 
using the Sherif-Hovland technique (1). About 62 per cent of the subjects 
that were considered previously as moderates (at positions, D, E, and F) on 
the evaluative dimension turned out, as shown by their mean evaluative rat- 
ings [see Table 1], to be either strongly for or strongly against Arab unity 
rather than moderates. Thus, if the majority of so-called moderates have 
extreme rather than neutral stands, as revealed by the foregoing results, then 
it is reasonable that they would reject more statements than they accept (1). 
It may very well be that for our Arab subjects a truly neutral or moderate 
Stand is impossible to find. In other words, it may mean that those subjects 
who consider themselves to be moderates are not really expressing their true 
feelings, and that the use of less-direct techniques of questioning may help to 
Teveal this tendency. | 

On one hand, the Sherif-Hovland technique makes use of a series of sen- 
tences or statements (A to I) that vary all the way from “the achievement of 
Arab unity is absolutely essential from all angles in the Arab people’s in- 
terests” to the extreme opposite of this statement. And, among other things, 
the subject is simply asked to indicate the statement "most acceptable" to 
him. On the basis of classifying subjects in terms of statements "most ac- 
ceptable” to them, our moderate subjects were those checking Positions B, 
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E, and F. On the other hand, the semantic-differential technique uses scales 
of adjectival opposites instead of sentences or statements and such usage may 
result in less conscious control on the part of the subject and a greater readi- 
ness to respond with his true feeling about the concept. Thus, our results 
have shown that moderate subjects at Position D and at Position F (as de- 
termined by the most-acceptable statement to them from a series of nine 
statements—A to I) were, respectively, pro-Arab unity and anti-Arab unity 
(as measured by their mean ratings on four evaluative scales of the semantic 
differential), rather than moderates. In other words, it seems that the use of 
scales of adjectival opposites results in a more-correct estimate of one’s stand 
than when such stand is determined by the sentence or the statement “most 
acceptable" to the individual. 


Not only did the so-called moderate subjects reject significantly more state- 
ments than they accepted on the evaluative dimension (1), but they also 
showed selectivity in mass-communication media (i.e, newspapers and radio 
stations) similar to that shown by anti-Arab unity subjects (2). Again, such 
similarity to anti-Arab unity subjects compels us to doubt the neutrality of 
these so-called moderates, as determined by the statement "most acceptable" 
to them. Additional information confirming our doubts is provided by the 
general view that these subjects have of Arab unity, as indicated by their 
mean ratings on a number of scales representing potency and other factors 
of meaning [see Table 1]. Thus, our results have shown that these subjects 
view Arab unity as "quite weak, emotional, slow, far, and difficult." In short, 
it seems that the meaning of the concept “Arab unity" to so-called moderate 
subjects (at Positions D, E, and F) is similar in a number of respects to the 
meaning that the concept has to anti-Arab unity subjects (at Positions G, H, 
and I). 

Finally, subjects were classified in terms of the rating they gave for "Arab 
unity" on the good-bad scale, as representative of the evaluative factor, in 
order to find out the differences in meaning of the concept for various groups 
of subjects having different attitudes. "This classification of our sample was 
followed because we are interested in finding out if differences in evaluation 
of the attitude object are necessarily associated with similar systematic differ- 
ences in the other factors of meaning. Furthermore, this procedure was fol- 
lowed because (on the one hand) we have here in the concept of "Arab unity" 
a controversial issue that is highly ego involving for our subjects and (on the 
other hand) because the evaluative factor accounts for approximately 54 per 
cent of the total variance (4). Thus, as may be seen from Table 2, our re- 
sults show that subjects who evaluated "Arab unity" favorably, as shown by 
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their mean evaluative ratings, had near-neutral mean potency ratings and 
felt also that Arab unity is "slightly far as well as quite or slightly difficult." 
In other words, the same subjects who feel that Arab unity is “very good” or 
“quite good” also feel that such unity is “not so near,” “quite difficult,” but 
“quite or slightly active, real, possible, and predictable.” This Picture that 
subjects with an extreme-positive stand towards Arab unity have could not 
have been known or predicted from their positive evaluative stand alone. 
Thus, the use of scales representative of factors other than the evaluative 
factor alone, enabled us to get a much more-complete picture about the mean- 
ing of the concept of Arab unity to our extreme pro-Arab unity subjects. 
Now, what about our anti-Arab unity subjects? As shown in Table 2, our 
results show that subjects who evaluated “Arab unity” unfavorably, as 
shown by their mean evaluative ratings, also had mean ratings indicating low 
Potency, in addition to their other expressed feelings that Arab unity is “very 
far,” “quite or very passive,” “very difficult,” “very imaginary and impos- 
sible,” as well as “quite unpredictable.” It seems that subjects who feel that 
Arab unity is “very bad” or “quite bad” have a tendency, regardless of which 
Scale of adjectival opposites is presented to them, to check the apparently” 
unfavorable end of the scale as best representative of their feeling. Thus, be- 
cause Arab unity is “very bad,” it is also “very passive, difficult, imaginary, 
impossible, and even unpredictable.” One may gonca here the contradictions 
involved in believing that Arab unity is “impossible” but at the same time 
“unpredictable.” Also, similar inconsistencies appear in their feelings that 
Arab unity is “weak, passive, and imaginary,” yet at the same time injurious 
and dangerous.” All kinds of explanations can be offered here to account for 
these contradictions and inconsistencies; but, at the same time, it must be 
pointed out that from the point of view of the anti-Arab unity subjects 
themselyes these contradictions are perhaps neither recognized nor admitted 
as such. If Arab unity is “imaginary and impossible,” then one should he 
able to “predict” that it will not materialize; however, being predictable 
may have had a “favorable” flavor about it, thus implying actualization of 
unity and, therefore, making our anti-Arab ШУ subjects feel that Arab 
unity is “unpredictable.” That Arab unity is ‘dangerous and injurious 0 
our anti-Arab unity subjects throws further light on our previous finding 
showing that these subjects, much more so than pro-Arab unity subjects, 
Were highly selective in the kinds of mass-communication | media to which 
they expose themselves, the selectivity being in the direction I media ex- 
Pressing their own points of view about the Issue:in question (2). It is only 
Teasonable to explain this "avoidance" behavior of anti-Arab unity subjects 
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as related to their conception of Arab unity as “dangerous and injurious.” 
Thus, on the whole, it appears that those subjects who evaluate Arab unity 
unfavorably tend also to pick the "apparently" unfavorable end of any scale 
(be it representative of potency, activity, or other factors) as best expressing 
their feeling. 

As to subjects with neutral mean evaluative ratings, our results show that 
they judge Arab unity as "quite slow and emotional” and, in a manner similar 
to anti-Arab unity subjects, "slightly far, impossible, and unpredictable" as 
well as "quite difficult” [see Table 2]. Again, the tendency on the part of 
subjects who evaluate Arab unity as "equally good and bad" to view Arab 
unity in terms similar to anti-Arab unity subjects gives further explanation 
to the similarity in selectivity of mass-communication media previously be- 
tween moderate and anti-Arab unity subjects (2). 

In concluding this discussion, it seems to the author that accuracy in meas- 
urement and consequently in prediction can be considerably improved if a 
combination of the Sherif-Hovland technique (7) and the Osgood semantic- 
differential procedure (5) is used. On one hand, the Sherif-Hovland tech- 
nique provides not simply a "point" or "score" but a "latitude of acceptance" 
and a "latitude of rejection" on any continuum or scale, thus increasing our 
information about the subject's stand on an issue and reflecting more ac- 
curately the realities of life situations. On the other hand, the semantic- 
differential procedure affords richer information about the meaning of the 
attitude object through the use of scales found to be representative of evalua- 
tion, potency, activity, and other factors. In addition, the semantic-differential 
technique is a simple "procedure in which a single stimulus is judged suc- 
cessively against a series of different scales" (3, p. 26) representative of the 
various independent factors of meaning. 

About six scales representative of the various independent factors of mean- 
ing (i.e., evaluation, potency, etc.) and appropriate to the concept in question 
are presented to the subjects in the regular semantic-differential rating form. 
However, for each of the scales, subjects are asked to indicate the position 
“most acceptable” as well as other positions “also acceptable” to them. The 
individual’s score on each scale would be the mean value of his latitude of 
acceptance: 1.е., his mean acceptable position. This modification should give 
us more information about the meaning of a concept than either of the above 
techniques alone and consequently should help improve prediction consider- 
ably. Such a procedure is worth investigating more thoroughly in future 


research. 
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E. Summary 


The purpose of the present study was to use the semantic-differential tech- 
nique in an attempt to throw more light on the meaning of “Arab unity” to 
subjects with various stands and to clarify some of the unexpected findings 
reached in earlier studies (1, 2). 

The same sample used in earlier studies (1, 2), consisting of 260 under- 
graduate Arab students at the American University of Beirut, was utilized 
in this study. In addition to information obtained previously (1, 2), the 
subjects were given a series of 13 different scales against each of which the 
concept of “Arab unity” was to be judged successively, according to the 
semantic-differential procedure. 

In general, the results of this study show that additional information ob- 
tained from scales representative of evaluative as well as other factors on 
the meaning of the concept “Arab unity” help to clarify some of the unex- 
Pected findings of previous studies (1, 2). Thus, subjects considered pre- 
viously to be moderates turned out to be mostly for or against Arab unity, 
as shown by their mean evaluative ratings. Consequently, the behavior of so- 
called moderates in rejecting significantly more statements than they accept 
(1) and in exhibiting selectivity in mass-communication media similar to anti- 
Arab unity subjects (2) are explained more fully in the light of additional 
information obtained from the use of the semantic-differential technique. Also, 
the meaning of “Arab unity” to pro-Arab unity and anti-Arab unity subjects 
is more fully understood as a result of information provided by results ob- 
tained from the semantic differential. А : 

Finally, a modified technique is suggested by the author in which scales 
representative of the total semantic differential are presented to the subject 
and, in line with the Sherif-Hovland technique, latitudes of acceptance are ob- 
tained on each of the semantic-differential scales. Such a procedure appears to 
help in improving prediction and is worth further investigation. 
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EXPERIMENTAL SOCIAL PSYCHOLOGY AND 
ITS NEIGHBORS* 1 


Department of Psychology, University of Washington 


Ezra STOTLAND 


A. INTRODUCTION 


Frequently social psychologists find themselves the targets of a variety 
of semirhetorical questions: How can there be a valid scientific theory in social 
psychology? How can there be any scientific laws in social psychology? and 
so forth. Sometimes social psychologists attempt to answer these questions, 
Sometimes not. Often, social psychologists ignore the questions in public; 
but, when the critics have gone home, they think about them. This paper is 
the result of one social psychologist’s attempt to come to terms with a few 
questions like those posed. Three questions are considered. Each concerns the 
relationship between social psychology (especially laboratory social psychol- 
ogy) and one of its neighbors: other types of experimental psychology, common 
Sense, and field studies. 


B. QUESTIONS 


1. Can There Be a "Social" Psychology Separate from or 
Independent of the Rest of Psychology? 


Can social psychology be "reduced" or eliminated by being absorbed into 
the rest of psychology? This question can be answered only by making an 
arbitrary but not meaningless distinction between two levels of discourse 
in social psychology (with full recognition of the existence of many shades of 
grey between the two levels). One level concerns the individual reacting to 
a situation that includes other people or symbols of other people. Any laws 
that are developed for this level are laws in which the individual's reaction 
is the dependent variable. Let us call such laws individual-social laws. At 
another level, dependent variables concern the relationships between or among 
Persons: e.g., leadership, a role structure, a group process, etc. Let us call 
these laws group laws. 

Individual-social laws bear a close family resemblance to the laws in general 
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psychology. They can and have been translated into simple S-R laws, laws of 
perception, etc. Our question is, “Are individual-social laws different in any 
fundamental way from other laws in psychology ?” 

My answer is that they are not different in a fundamental way, but that 
the values of certain parameters in the individual-social laws have to be con- 
sidered in comparison with the values of those parameters in the laws of 
general psychology. The general laws and the individual-social laws may 
vary systematically with variations in the values of the parameters. For ex- 
ample, suppose a parameter to be ambiguity of the stimulus situation. In 
general, but not always, social situations are more ambiguous than nonsocial 
situations. It may be that certain laws relating motivation to perception take 
one form in clearly defined situations and another form in more ambiguously 
defined ones. If so, one would expect that the application of these laws to 
social situations would necessitate consideration of the generally greater 
ambiguity of social situations. 

My argument is not for the reduction of social-individual laws to general 
laws; it is for the "reduction" of the differences between a social-individual 
hypothesis and what usually is called a general psychological hypothesis to 
a number of differences in parameter values (e.g., ambiguity, type of moti- 
vation, degree of similarity of subject to the source of stimulation) and for 
a systematic consideration of whether or not the differences їп parameter 
values lead to differences in the laws, 

The foregoing approach has several advantages. First, it implies study of 
variations along cach parameter while holding the type of law constant. Not 
all social situations are ambiguous; not all nonsocial situations are clearly 
defined. Social situations would be varied in ambiguity; so would the non- 
social. If the universe is as uniform as generally it is thought to be, the varia- 
ations in the degree of ambiguity should have the same effect on the form of 
the laws in individual-social conditions as in the nonsocial conditions. Thus, 
the first potential advantage is parsimony, 

Second, the second-order law relating the parameter value to the first-order 
law would receive increased empirical support from the data coming from 
the social and nonsocial realms; thus the confidence that could be placed in 
the law within each realm would be increased by the knowledge that it 
derives support in more than one realm. 

Third, if variations of parameter values in individual-social and nonsocial 
realms lead to variations in the form of the laws, greater precision of pre- 
diction can be attained. Because social situations or nonsocial situations are 
not equally ambiguous or equally well structured, consideration of the degree 
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of ambiguity within the social or within the nonsocial realms should lead to 
more exact predictions than otherwise, Attempts to place the individual-social 
laws and nonsocial laws in the same rubric can have heuristic value, suggesting 
new parameters in both areas or suggesting new types of laws. It is important 
to note that, in this way, the general psychologist derives useful suggestions 
from his social-psychologist neighbor. 

Fourth, if the relationship between parameter values and the form of the 
laws should prove as uniform as suggested, any explanations of the laws 
should apply equally well to the social and nonsocial situations, By explanation 
of the laws, I mean those mediating processes (within the individual or in 
his history) that are assumed to be necessary for the lawful relationships 
to obtain. Again, greater parsimony is achieved. 

At the second level of law, the group law, there is no analogous type in 
general psychology. There is no way of saying meaningfully that, after a 
certain amount of cooperation between a person and a machine, a mutual 
attraction develops. In other words, the laws in general psychology cannot 
be treated in terms of variations on a parameter. These group laws, as state- 
ments of regular empirical relationships, cannot be special Cases of general 
laws that apply also to nonsocial situations. Despite the foregoing, the separa- 
tion on the level of empirical laws should not lead one to the conclusion that 
these laws are floating in scientific space with no anchors. The anchors become 
evident when an attempt is made to explain the laws. In this case, the expla- 
nation of the laws may consist of a set of individual-social laws or individual 


nonsocial laws. $ 
For example, we might attempt to explain why two or more people who 


have successfully cooperated as part of a group develop a mutual attraction : 
Le, that group cohesiveness rises. We might invoke some law, such as the 
following: If an individual is rewarded regularly by some entity (either 
Social or nonsocial) he seeks to be in the presence of that entity, This law 
is then applied to each relationship of one group member to another (ie, а 
Series of one-to-one relationships). A logical outcome of the application of the 
law to the series of relationships is the empirical law relating successful 
Cooperation to group cohesiveness. The group law is not eliminated. It is 
Tetained in its original form, but it has been explained and (through the 
*xplanation) has been related to the total body of psychological knowledge ; 
thus, indirectly, to all knowledge. 

This process of explaining laws can be attempted, of course, at all levels 
of law: the individual-social, the nonsocial. Laws might be explained by 


Cognitive processes, by neural processes, etc. The process does not explain 
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away the laws. The laws remain, neither as simply arbitrary facts of nature 
nor as “regularly occurring accidents,” but remain as part of a general body 
of laws. 

But why retain the laws if they can be explained in principle or in fact? 
There are a variety of reasons. First, the exact theoretical formulation or 
the empirical basis of the explanation of the laws may not have been attained. 
To say that such explanations are possible in principle is not sufficient with 
regard to any particular law. It is also necessary to point out how the expla- 
nation applies (cf. 4). 

Another reason for retaining the laws applies after the explanations have 
been discovered. This reason is that the derivation of the law from its expla- 
nation may be so complex that the law is a useful shorthand. It is impractical 
and pointless to go through the process of logically deriving the law relating 
successful cooperation and mutual attraction over and over again for each 
case to which the law applies. The issue is more important if the law is to 
be used as the basis of explaining laws at a still higher level of abstraction. 
Suppose that a researcher is interested in developing a law concerning the 
stability of large institutions, such as factories or other firms. He may need 
to assume the validity of the group laws in attempting to explain laws at 
the institutional level. The researcher cannot start down the trail of reduc- 
tionism that leads to abandoning his institutional laws and to formulating 
everything in terms of subatomic particles. No nonnuclear scientist, no gen- 
eral psychologist would have а raison-d’ etre. 


2. How Does Social Psychology Go Beyond Common Sense? 


Unfortunately, there is much evidence for the accusation implicit in the 
question. Social psychologists have tested experimentally theories such as 
“a person will do something if it gets him something he wants and if it 
doesn’t cost too much”; “if someone agrees with you, you like him more"; 
and "if you frustrate someone, he will become angry." In each case, as in 
all cases of common-sense propositions, no amount of laboratory testing will 
refute the fact that at least one and probably many instances consistent with 
the proposition can be found in the real-life experience of social psychologists, 
other psychologists, and just plain people. If the laboratory test does not 
support one of the propositions, it is the test that frequently is rejected, not 
the proposition. 'The test can be rejected on grounds of artificiality, super- 
ficiality, erroneous assumptions, or (for the "know-nothing") on grounds of 
“зо what?" Because so many assumptions are involved even in the simplest 
of experiments, it is just as easy to reject the assumptions and the test as 
it is to reject the hypothesis. 
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In defense of social psychologists, one often argues that one man’s common 
sense is another man’s puzzle. But this fact does not contradict the point that, 
in some cases, there can be no question that the propositions or hypotheses are 
of a common-sense nature. 

The germ of a defense for social psychology emerges when contradictory 
common-sense propositions are placed side by side. The scientific question, 
however, should not be formulated in terms of which proposition is true 
and which is not. As indicated, from real life one can find instances consistent 
with both propositions. The scientific question is, “Under what conditions is 
One proposition true; under what conditions is the other true?” The most 
useful experimental design is one in which those conditions are determined. 
Often one requires an experimental design in which both propositions are 
tested under conditions varied systematically. If only one proposition is tested, 
all one can conclude is that under certain specified conditions the proposition 
tested is valid. One can say nothing about the truth or falsity of the other 
Proposition. The scientific question can be answered only if the sufficient con- 
ditions for each of the propositions have been ascertained. 

The approach of finding the sufficient and, in some cases, the necessary 
Conditions for a proposition to be true does not apply only to cases of contra- 
dictory common-sense propositions. It can be made to apply to some propo- 
sitions that do not appear to have any commonly believed contradictions. For 
example, the author does not know of any common American proverb that 
contradicts the idea that "we like people who help us," yet question can be 
raised as to whether this proposition is universal. A researcher might ask 
"Under what conditions is it false?" Perhaps the perceived motivations of 
the helper are relevant. 

The same approach can be applied to common-sense propositions, such as 
maximizing gains and minimizing losses, provided some independent criterion 
is established for “gains” and “losses.” “Under what conditions do people not 
maximize gains and minimize losses, even when they clearly and accurately 
Perceive the consequences of their actions?” Dostoevsky contended that people 
do not and should not always act in such a rational way. 

There is another way of handling the common-sense argument. In real life, 
the sufficient conditions for a process often are intertwined with nonrelevant 
Conditions to such an extent that it is impossible to separate one from the 
other. For example, if the proposition under scrutiny is that "we like some- 
Опе who agrees with us," what are the sufficient conditions for liking to 
occur? Is it sufficient that we are aware of the fact that the person agrees? 
Is it sufficient that he communicates this agreement to us? Is it sufficient that 
We observe him in the process of coming to agree with us? The researcher 
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must attempt to discover which one or more of these conditions are sufficient. 

‘The foregoing is counter to the advice sometimes given [see French (2) ] 
that we “‘start strong” in the laboratory, that we attempt to administer the 
independent variable by using a single induction that contains as many dif- 
ferent manifestations or aspects of the independent variable as possible: e.g. 
several dimensions of status. If the phenomenon has been observed in at 
least one instance in real life, no new knowledge is gained by observing the 
phenomenon again in the laboratory; and “starting strong” sometimes may 
accomplish only that. By "starting weak” (i.e., by inducing the independent 
variable in one precisely defined way), we may come closer to discovering the 
conditions sufficient for the dependent phenomena to occur. Of course, we run 
the risk of having bet on the wrong condition; but that risk is what makes 
research fascinating. 

The foregoing argument implies that if the hypothesis in the laboratory 
concerns a process that does not fit into common sense one may find it helpful 
to "start strong" at the beginning of a program of research. The first step 
in such a program is to establish that the phenomena does in fact occur—a 
finding that should surprise everyone but the researcher and some astute 
laymen. It is at this point that the laboratory is at its most productive and 
interesting. 


3. How Can We Generalize from Laboratory То Real Life? 


To answer this question, it is necessary to subdivide it into several smaller 
questions. First, “What sorts of real-life questions is it possible to answer in 
the laboratory?" There are at least two types of questions that cannot be 
resolved in the laboratory. One often has been pointed out. It is neither 
ethical nor practical to deal with certain variables in the laboratory. Such 
variables are suicide, extremely stressful conditions, and extreme aggression. 
Sometimes it is possible in the laboratory to deal usefully with watered-down 
versions of the foregoing variables, but one should not mistake a watered- 
down version for the "real thing." Factors that function in the laboratory 
might not function in the real-life condition. One must be careful in making 
extrapolations from the laboratory to real-life situations or, for that matter, 


vice versa. 

А second type of question that cannot be resolved definitively in the labora- 
tory is a chicken-and-egg puzzle that often grows out of field research or even 
correlational survey research, although some researchers [e.g., Festinger (1)] 
have stated that the laboratory can resolve the puzzle. A question often asked 
is, "Does A cause B or does B cause A?" 'This type of question cannot be 
resolved definitively in the laboratory. 
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In the first place, because of the way the question is sometimes formulated 
there is no reason ло? to believe that both alternatives are true: i.e., that A 
and B influence one another. Of course, in many cases, such mutual influence 
is simply not possible; but, in the cases for which the question does arise, it 
is evident that both directions of causation are considered possible. For mutual 
influence to occur, there must be some time lag in the effects. If A caused B 
instantaneously, and B caused A instantaneously, A and B would fuse into 
a sort of steady-state unity that could be considered as one entity. For most 
social-psychological phenomena there is a time lag. 

Let us assume that it is possible for A to cause B and for B to cause A. 
If, in the laboratory, we demonstrate that A causes B, that fact does not con- 
tradict the proposition that B causes A. At this point, the laboratory has ac- 
complished little. Furthermore, if it is found that A does not cause B in the 
laboratory, it still might do so in real life. The finding in the laboratory may 
be interesting, but it is ambiguous because there are many factors that might 
account for the failure. If the question is reformulated as, "Does A cause В 
more than B causes A?” the answer is to be found in the field situation and 
not in the laboratory, because the factors enhancing one direction of causation 
rather than the other might be present only in the real-life situation. 

What, then, can the laboratory accomplish in chicken-and-egg problems? 
The answers basically are the same as those given to the questions about com- 
mon sense. When there is serious doubt as to whether A leads to B or B leads 
to A or both, the laboratory can establish that, under certain circumstances, 
One or both of the sequences occurs. This demonstration is the laboratory's 
Most useful function. 

Note that the negative does not apply. The laboratory cannot prove that 
a given sequence does not occur in real life, but when there is no doubt about 
the occurrence of a sequence in real life, the laboratory might provide evi- 
dence with regard to the conditions sufficient for one of the sequences. 

The essence of the argument is that neither the laboratory findings nor 
the field studies can refute one another. If the results are inconsistent, the 
Problem is to discover what parameters account for the difference. After all, 
behavior in the laboratory is just as real as behavior anywhere else. The prob- 
lem is to interpret theoretically the findings—not to have them clash [cf. 
Hovland (3)]. 

What are the bases for generalization from the laboratory to real life? 
Of Course, there are the useful warnings about populations of people and 
Of situations. However, such warnings are of no use unless we know some 
Way of taking them into consideration. A first approximation to an answer 
derives from an examination of the theoretical variables that are assumed 
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to have been actualized in the laboratory. Presumably, we should be able 
to generalize to other instances of these theoretical variables. 

Parallel statements can be made for the population of subjects. The rub 
lies, however, in the fact that any concrete event or sequence can be con- 
strued to be an instance of an almost infinite variety of abstractions. The 
question becomes, “What grounds can the researcher have for interpreting 
his experiment in the way that he does?” The answer lies in either one or 
both of two directions. First, the researcher may have data that can be 
construed as reflecting other instances of the same phenomena. These instances 
may be the same phenomena or the same type of phenomena. For example, 
there may be instances of leadership roles showing a given effect on the role- 
player or instances of roles other than leadership roles having the same effect. 
The rub in this type of answer is the excessive ease and simplicity of the 
construing. However, if enough instances can be cited the easiest and simplest 
construction should become apparent. These instances should, of course, come 
from as wide a variety of laboratory and field studies as possible. (This type 
of argument also applies to the relationships among field studies.) 

In some instances, the real-life "replication" of a laboratory situation may 
not have occurred anywhere in society or may not have been recorded; yet 
social changes and continued studies in the obscure corners of human society 
may produce these “replications.” The laboratory findings, then, would have 
prepared the way for an understanding of newly occurring or newly dis- 
covered social-psychological processes. It would be a poor science, indeed, that 
did not have some basis on which to make predictions about new situations 
before they occurred. For example, social psychology should be prepared 
to say something now useful in a world society in which all-out war does 
not occur. 

The second type of answer to the question of generalizing from laboratory 
to field concerns the explanation for the causal sequence or law found in 
one or more studies. By explanation, the author means an account of the 
process postulated to mediate between cause and effect. The mediational pro- 
cess might involve a level of data different from that for the cause and the 
effect. This process can occur within the person in a form governed by indi- 
vidual-social laws or it can occur among a number of people (when a group 
law is involved). It should be obvious that the law would apply to cases 
for which there is independent ground for assuming that the mediational 
process could and does occur. For example, suppos the mediational process 
explaining the law is one that entails a person’s seeking to perceive the world 
as consistently as possible. It would be expected that the laws could be gen- 
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eralized to any population for which there existed independent grounds for 
believing that there was motivation to achieve consistent perceptions. 

The independent grounds can take a variety of forms: independent test 
results, knowledge of the cultural norms, knowledge of the early learning 
experience, or knowledge of their psychological structure. The original law 
would be strengthened, not weakened, if one found that the law did not hold 
in populations for which there was little or no reason to believe that the 
assumed mediating process was operative. The law would be strengthened 
in the sense that its explanation would have been independently supported. In 
short, the key to sound generalization from the laboratory to real life lies 
in the explanation of the laws based on laboratory findings. 

This approach suggests also that, before predictions are made on the basis 
of laboratory findings, the researcher must ask himself not only what popu- 
lation of persons and of situations he believes he has sampled; but also ask 
himself what explanation he was using and what grounds exist for believing 
that the explanation holds in cases outside the laboratory. These grounds can 
come from levels of analysis and of scientific discourse different from the 
one at which the researcher works. For example, he might need to be con- 
cerned with economic factors that change child-rearing practices and cultural 
norms in such manner as to reduce the probability that the necessary mediating 


conditions will occur. 
C. ConcLusION 


The general conclusion with regard to the three questions posed in this 
Paper is that social psychology in its experimental form can make a unique 
contribution to the total scientific enterprise provided some thought is given 


to its role with regard to its neighbors. 
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ADOLESCENT RESPONSE TO RELIGIOUS APPEAL 
AS RELATED TO IPAT ANXIETY* 


Dallas Baptist Association and Houston Independent School District** 


C. Ewixc CooLEY AND Jerry B. HUTTON 


A. PROBLEM 


Augustine has been described as having been a very anxious person. How- 
ever, it has been noted that “Опе of the most remarkable effects of his con- 
version and subsequent career was a marked reduction of anxiety symptoms 
(2)." The area in religion referred to as “conversion,” committal, or rededi- 
cation is a fruitful but often inaccessible area for research. 

The current study is concerned with two questions: (а) Are individuals 
with a high anxiety level more apt to respond to the emotional appeals of 
religious invitations? (5) Will individuals who respond to emotional appeals 
in a religious setting have a decrease in anxiety level? 


B. METHOD 


Seventy-two of 255 adolescents attending a Southern Baptist youth camp! 
responded to a religious appeal by moving to the front of the tabernacle fol- 
lowing a series of preaching services and during a special invitation period. 
The campers were from Houston and surrounding suburbs. They represented 
Practically every section of town and ranged from lower to higher socio- 
economic status. The 72 adolescents who made a response were responding 
to a call for conversion, rededication, or special-service. Most of the decisions 
were for rededication. Both the total number of campers and the number 
making decisions closely approximated +3 per cent males and 57 per cent fe- 
males. Table 1 compares all campers with those who made response. 

On the first day of camp, during an afternoon rest period, the IPAT Anx- 
iety Scale Questionnaire (Self-Analysis Form) was given to all student camp- 
ers. Those who made responses to appeals throughout the week were given 
the test again on the last day of camp, three days later. Comparisons were 
made between the test scores before and after the response. 

It was not practical to include a control group in this study as the purpose 
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TABLE 1 
AGE AND SEX COMPARISONS OF THE TOTAL NUMBER OF CAMPERS 
WITH Тноѕе WHO MADE RESPONSES 


Age 
Population and sex 13* 14 15 16 17 18 19 Total 
All campers 
Male 10 25 24 31 16 1 3 110 
Female 10 38 32 36 24 4 1 145 
Total 20 63 56 67 40 5 4 255 
Campers who responded 
Male 3 8 9 6 4 0 1 31 
Female 2 14 7 6 9 3 0 "m 
Total 5 22 16 12 13 3 il 72 


* Those who were 13 years of age were within three months of being 14. 


of the Harris County Baptist Association was to invest in testing primarily 
as an aid in screening and group work. Funds were not available to obtain 
pretests and posttests from a comparable group of adolescents who did not 
attend camp or from those in the camp who made no response. 


C. RzsuLTS 


Of the 255 campers, 55 had a sten rating of 8 or above indicating a high 
anxiety level. This involved 22 per cent of the total number of campers. 
Fourteen of the 72 who made responses had a sten rating of 8 or above, rep- 
resenting 19 per cent with high anxiety. Thus, it appears that those with high 
anxiety ratings did not tend to respond to religious appeals to a greater extent 
than did those with low anxiety ratings. 

The total number of campers had a sten mean of 6, while those who made 
responses had a sten mean of 6.2. Again, it appears that those who made re- 
sponses did not tend to have a significantly higher level of anxiety than the 
total number of campers. 

Based on the frequencies of males and females who increased, decreased, or 
remained the same in raw-score anxiety level following response to a religious 
appeal, chi squares were found to be 24.9 for males (p < .001), 5.4 for fe- 
males (? < .02), and 22.8 ( < .001) for all responders. 

The test-retest reliability for the three-day interval was .78. The pretest 
standard deviation was 9.7 (mean = 36) and the posttest standard deviation 
was 10.8 (mean = 31). A critical ratio of 6.1 shows that the mean on the 
posttest is significantly lower (indicating lessened anxiety) than the pretest 
mean. 

D. DiscussioN 

One can conclude that adolescents who respond to invitations for conver- 
sion, rededication, or committal to special service are not necessarily individ- 
uals with high levels of anxiety. Tt follows that the emotional appeal of a 
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religious service does not tend to cause some individuals with high anxiety 
levels to respond to a greater degree than those with other anxiety ratings. 
In other words, anxiety does not seem to be related to a response to a religious 
appeal. 

It should be noted that although it appears that anxiety was significantly 
lowered among those who responded, it is not known if the remaining camp- 
ers demonstrated changes in the level of anxiety. Perhaps, due to the total 
experience of the camp, all of the campers experienced a lessening of anx- 
iety level. A possible restudy is suggested that would give a second test to 
those who did not respond, and a first and second test to those who did not 
attend camp. A comparison could then be made between the three groups to 
see if the total camp experience or response to a religious appeal might be 
influencing factors. 

By observing the changes in the various areas of anxiety as subdivided by 
Cattell and Scheier (1), it appears that the lowering of anxiety occurs to the 
greatest extent in the areas of suspiciousness, paranoid insecurity, and guilt 
proneness. The decrease does not seem to occur as significantly in areas related 
to personality structure. Anxiety related to ego strength is not lowered signifi- 
cantly. Although the part scores are too brief to be of significance, it might 


be said that the reality of the situation has not changed. The individual still 


has to face the same problems in life. 
E. SuMMARY 


Seventy-two adolescents, among 255 attending a summer youth camp in a 
religious setting, responded to group invitations for rededication, conversion, 
or committal to special service. The total number of campers were given the 
IPAT Anxiety Scale Questionnaire on the first day of camp, and the 72 re- 
sponders received it the second time on the last day of camp. The comparison 
of pretests and posttests suggests that levels of anxiety are not significantly 
related to responses to religious appeals. Furthermore, it appears that those 
who responded to the appeals experienced a lessening of anxiety to a signifi- 


cant degree. 
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FAMILIARITY, DOGMATISM, AND REPORTED STUDENT 
ATTITUDES TOWARD THE DISABLED*? 


Rehabilitation-Education Center, University of Illinois 


Jack К. Genskow AND Frank D. MAGLIONE 


A. INTRODUCTION 


Phenomena related to attitudes of persons toward the disabled should, it 
would seem, be understandable and capable of description in terms of the 
psychology of attitudes in general. When given two milieux known to differ 
in a particular characteristic suspected as capable of affecting attitude expres- 
sion, and when introducing specific conditions also suspected as capable of 
affecting such expression, data obtained by means of devices purporting to 
measure expressed attitudes should “behave” in generally anticipative fashions. 
More specifically, one might expect differences in expressed attitudes toward 
the disabled between a campus population having considerable familiarity with 
the disabled and a second having little familiarity. Further, differences might 
ach population if the measuring devices were adminis- 
tered by a person in a wheelchair and by a person not in a wheelchair. It 
was believed that attitudinal data obtained under these varying conditions 
could be described in terms of more inclusive behavior characterizations. 

) suggests the existence of a tendency toward con- 
perception, affect, and behavior. Consistency 
d with regard to attitudes toward the 


be expected within е 


Equilibrium theory (4, 7 
sistency between cognition, 
among these elements would be expected \ i i 
disabled ; for example, an increase in positive or negative affect in the presence 


of a disabled individual would tend to be reflected in the cognitive behavior 
of a person completing an unsigned attitude questionnaire dealing with the 
disabled. Rosenberg eż al.’s concept of “threshold for inconsistency” (11) sug- 
gests that in some situations a person’s attitude (or reported attitude) would 
be more consistent with its cognitive, affective, and behavioral elements than 
in other situations. Thus, with a disabled individual present, a subject’s anon- 
ymously reported attitude might be more consistent with his perception, cog- 
nition, and affect than in a less reality-bound situation when no disabled in- 
dividual is present. Heider’s “balance theory of sentiments” suggests that 
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unfamiliarity, as well as perceived lack of similarity, between the perceived 
and the perceiver, tends to yield a negative attitude. Hebb (6) stated that an 
object (e.g., a disabled person) which is partly familiar—i.e., which presents 
stimuli that arouse “habitual processes of perception” but that also is, different 
in enough respects to elicit “incompatible processes”—tends to produce fear. 
Thus attitudes toward the disabled could be posited to depend in part upon 
the perceiver’s previous contacts or familiarity with the disabled, as well as 
upon perception of similarity—dissimilarity.2 

Rokeach (10) was concerned with the structure or organization of belief 
systems (e.g. tolerance of others) rather than with the specific content of 
the belief system. He differentiated between individuals who are relatively 
open (low dogmatic or low authoritarian) and individuals who are relatively 
closed (high dogmatic, high authoritarian) with respect to belief-disbelief sys- 
tems about reality. Rokeach suggested that the highly dogmatic person tends 
to see his environment as threatening and (as a result) is intolerant of others, 
including (the present authors assume) the physically disabled. 

In terms of such theoretical orientations, the following hypotheses were 
examined. 

1.. Two groups believed to differ in familiarity with the disabled would be 
expected to respond differently to a questionnaire containing items purporting 
to measure attitudes toward the disabled. Thus the group more familiar with 
the disabled would be expected to report a more positive attitude toward the 
disabled than would the group which was less familiar with them. 

2. Further differentiations between the groups could be made between 
questionnaire administrations by an ostensibly disabled person and an osten- 
sibly nondisabled person; i.e., with the “disabled” administrator present, the 
group more familiar with the disabled would react more positively toward 
the disabled; and the less familiar group would react more negatively toward 
them than they would when no “disabled” person was present. 

3. In view of the possible dynamic relationship between dogmatism and 
the expressed attitudes, it was hypothesized that items purporting to measure 
degree of dogmatism would show a relationship to attitudes—with high dog- 
matism being associated with negative attitudes expressed toward the disabled. 


B. METHOD 


The subjects were 111 students (approximately 60 of whom were female) 
in two educational-psychology classes at each of two state universities: Indiana 
and Illinois. The 57 students at the University of Illinois were juniors and 


2 See Wright (13) for a further discussion of Hebb’s and Heider’s theories 
applied to attitudes toward the disabled. 
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seniors who had the same instructor. One Indiana University class was com- 
posed of upperclassmen; the other class, with a different instructor, consisted 
of underclassmen. Because the University of Illinois has had an extensive pro- 
gram for physically disabled students for a number of years and because In- 
diana University does not have such a program, the Illinois subjects are desig- 
nated as the familiar group (F); the Indiana subjects, as the unfamiliar 
group (U). 

'T wo questionnaires were combined : the Attitude Toward Disabled Persons 
Scale (TDP), developed by the Human Resources Foundation (14), and 
Rokeach's Dogmatism Scale (DS), Form E (10). 'Two combined forms of 
40 items each were used. The ATDP-DS-1 was composed of all 20 ATDP 
items randomly combined with the odd-numbered items of the 40-item DS; 
the ATDP-DS-2 randomly combined the same 20 ATDP items with the re- 
maining 20 DS items. The same tape-recorded directions were used for all 
questionnaire administrations. 

Four mixed-sex classes were used, two at each university. Each class was 
first administered the ATDP-DS-1; and, after an interval of one or two 
weeks, the ATDP-DS-2. Each of the four able-bodied administrators gave 
the questionnaire twice to different classes: once as an able-bodied individual; 
once while posing as a “disabled” individual in a wheelchair. Table 1 shows 
the procedural design. 

After each instructor announced to his class that cooperation in some re- 
search was desired, he turned the class over to the administrator, who pro- 
ceeded to distribute the questionnaires from the front of the room. Then he 
turned on a tape recorder and played directions that were used for every 
administration. When all questionnaires were complete, the administrator col- 


TABLE 1 
PROCEDURAL DESIGN 
Group U* Group F** 
Administration Class A Class B Class C Class D 
First (ATDP-DS-1) PE Bi di i5 
Administrator Administrator Administrator Administrator 
One One Two Two 


Able-bodied Wheelchair ^ Able-bodied Wheelchair 


S -DS- NS = 
econd (ATDP-DS-2) Administrator Administrator Administrator Administrator 


Four Four Three Three 
Wheelchair Able-bodied Wheelchair Able-bodied 
A-2 B-2 C-2 D-2 


* Two weeks elapsed between the first and second test administrations, 
** One week elapsed between the first and second test administrations. 
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lected them and left the room. The entire procedure took less than the total 
class period of 50 minutes. 
The tape-recorded directions were as follows: 

We are using this tape recorder to standardize the administration of 
this questionnaire. In other words, each group to which this questionnaire 
is given will hear these same directions given the same way over the 
tape recorder. This questionnaire is the first of a two-part research 
design, the results of which will be used for research purposes only. The 
second part of the scale will be given to you at a later date. The ques- 
tionnaire is designed to measure your attitudes and opinions toward 
several different factors. 

Please do not put your name or other identifying marks on the question- 
naire; this study is interested in the results of the class as a whole rather 
than individual results. If any of you know, or have seen the person 
administering the test before, please raise your hand [pause]. It is be- 
lieved that the directions are self-explanatory. Please begin now and pro- 
ceed at your own pace, 


To analyze the data the 7 test for independent samples was used with both 
the ATDP and the DS on all comparisons between groups U and F (i.e., 
total group comparisons, intergroup wheelchair administration comparisons 
and intergroup able-bodied-administration comparisons), and also on the 
A'TDP test-retest comparisons within both groups under the two administra- 
tion conditions: i.e, wheelchaired vs. nonwheelchaired administrators. "The 
more powerful and appropriate ¢ test for matched samples could not be used 
in the test-retest situations because the individual questionnaires were anony- 
mous and unidentifiable for individual comparisons between administrations. 
Anonymity also accounted for the Мз being different for each administration. 
Relationships between the ATDP and DS (split-half) forms for each admin- 
istration were determined by means of Pearson r. 


C. RzsurTS 


Using tests for independent samples, it was found that the group of col- 
lege students more familiar with the physically disabled (F) reported a more 
positive attitude toward the disabled than did the group less familiar with 
them (U), the difference being significant beyond the .0005 level [see Table 
2]. The two groups were seen to differ significantly also on the dogmatism 
variable, Group F reporting a less dogmatic approach ( < 01). [See Та- 
bles 2 and 3.] 

Able-bodied-administration comparisons between Groups U and F on the 
A'TDP differ at the .15 level of significance; whereas between-group wheel- 


E 


Ж | — 


TABLE 2 
ATDP лхо DS TOTAL-RESPONSE COMPARISONS 
Group U Group F 
Instrument Ns Mean SD Ne Mean SD Difference t pen 
ATDP 109 80.01 11.84 115 86.39 12.15 6.38 3.643 .0005 
DS 109 7548 12.78 113 70.12 11.14 5.36 2.807 01 


* N is based on two questionnaire administrations to each group. 
** ATDP, one-tailed ¢ test; DS, two-tailed ż test. 
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TABLE 3 
INTERGROUP COMPARISONS OF WHEELCHAIR AND ABLE-BopieD TEST ADMINISTRATIONS 
Administration ATDP Ds 
Group U Group F Nw» t pt t P 
AB AB 107 1.196 15 2.014 .05 
wc wc 115 3.508 0005 2.410 02 
WC/AB WC/AB 222 3.643 -0005 2.807 01 


* ATDP, one-tailed £ test; DS, two-tailed £ test. 
** N is based on two questionnaire administrations to each group. 


chair administrations differ at the .0005 level [see Table 3]. These data are 
in accord with the second hypothesis, which is that the wheelchair-administra- 
tion means for the two groups would differ in opposite directions from their 
respective able-bodied-administration means. However, Table 4 indicates that 
—whereas Group U’s wheelchair-administration mean was lower (less favor- 
able) and Group F's was higher (more favorable) than their respective able- 
bodied-administration means—as predicted these differences are significant only 
at the .15 level. If the more appropriate and more powerful ¢ test for paired 
samples had been used, the results might have reached a more significant level, 
The hypothesis that wheelchaired and able-bodied administrators effect a 
change in the reported attitude must be rejected, 

No product-moment correlation coefficients approaching significance were 


found between measures of dogmatism and attitudes toward the disabled [see 
Table 5]. 


D. Discussion 


Heider's view that both the unfamiliar and the dissimilar yield negative 
reactions appears to be applicable to attitudes toward the disabled. For Group 
U, which has had little or no previous contact with the disabled, the disabled 
were probably both unfamiliar and dissimilar and thus perceived in a nega- 
tive light. Group F would be more familiar with the disabled, and the per- 
ceived similarity—even if only in terms of being fellow students on the same 
campus—would also increase between the able bodied and the disabled. This 
increased familiarity and similarity would partially account for the more fa- 
vorable attitude expressed by Group F. Another factor in Group F's more- 
positive reported attitude is the fact that the nature of contacts with the dis 
abled students at the University of Illinois is probably more positive than 
negative because the disabled students are rigorously screened prior to admit- 
tance to the University, and thus constitute a positively biased sample of the 
disabled population. 


INTRAGROUP COMPARISON OF WHEELCHAIR es ИИ TEST ADMINISTRATIONS OF THE ATDP 
Wheelchair Able-bodied 
Group Classes N Mean SD Classes N Mean SD Difference t $ 
U B1 + A2 57 78.69 12.69 А1 + B2 52 81.83 14.38 3.14 1.208 45 
F D1 + C2 58 87.68 13.20 C1 + D2 55 85.02 13.39 2.66 1.087 415 


ЯМОІТОУИ са 3NV3J GNV MONSNAD '3 NOV. 


SEE 
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TABLE 5 
MEANS, STANDARD DEVIATIONS, AND INTERSCALE CORRELATION COEFFICIENTS 
FOR EACH ADMINISTRATION OF ATDP-DS 


ATDP* Ds** 
Administration Class N Mean SD Mean SD ress 
Group U 
Able-bodied A-1 27 83.14 14.82 79.66 12.89 —.049 
Wheelchair A-2 26 79.15 12.84 76.03 11.41 —.028 
Wheelchair B-1 31 78.25 12.89 74.67 10.87 —.027 
Able-bodied B-2 25 80.40 13.86 71.52 14.29 —,047 
Group Е 
Able-bodied C-1 25 84.16 12.84 74.00 9.68 —.005 
Wheelchair C-2 26 87.03 10.30 68.57 11.41 —0.28 
Wheelchair D-1 32 88.84 9.96 70.43 10.26 —.037 
Able-bodied D- 30 85.66 14.26 68.00 12.55 —.015 


* A high score indicates favorable attitude toward the disabled. 
** High scores indicate high dogmatic. These are split-half forms of the DS. 
*** Pearson product-moment correlation coefficients between the ATDP and DS. 


Equilibrium theory would account for the difference in reported attitudes 
between Groups U and F in terms of past inconsistencies in the attitude struc- 
tures of Group F, these inconsistencies being brought about by favorable ex- 
perience with the disabled. According to Festinger, if the dissonance created 
by these inconsistencies became greater than the resistance to change, then the 
least resistant elements of cognition would be changed ; thus reducing the dis- 
sonance. Thus, whereas the precollege attitudes of Group U and Group F 
may well have been similar, inconsistencies in Group F were brought about 
by contact with disabled students which these results suggest may have re- 
sulted in a positive change in attitude more consistent with experience with 
the disabled. 

These results are in agreement with other relevant studies (14), which 
used the ATDP and which also indicated that reported attitudes toward the 
disabled were more favorable for groups having had previous contact with the 
disabled. More research is needed before the processes underlying this rela- 
tionship between familiarity and acceptance can be more fully explained. It 
should be noted that when discussing the difference in attitudes expressed by 
Groups U and F, it is not necessary to correct for the difference in dogmatism 
between the two groups because attitudes and dogmatism were seen to be 
essentially not related [see "Table 5]. 

Although the hypothesis that wheelchaired and able-bodied administrators 
would yield attitude responses in opposite directions for Groups U and F is 
tentatively rejected, the results were in the expected direction. Furthermore, 
the hypothesis was rejected using a 7 test for independent samples in a test- 
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retest situation; the more powerful ¢ test for matched samples possibly would 
have yielded a larger ¢ and a stronger confidence level (by taking into account 
the correlation between test and retest). Also, whereas the interschool differ- 
ence in reported attitudes was barely significant when the able-bodied admin- 
istrations of the ATDP were compared, when the interschool wheelchair ad- 
ministrations were compared the difference was found to be highly significant. 
Thus it appears that the presence of a “disabled” individual accentuated the 
difference in reported attitudes between Groups U and F to some degree, 


which was predicted. Granting that further research is needed on this hypoth- 


esis, the present findings merit further examination. 

As stated earlier, both Heider and Hebb suggest that the unfamiliar and 
the partly familiar, which the authors presume the disabled to be, are “fear 
inducing.” Thus, Group U, which has had little or no contact with the dis- 
abled, would be expected to experience a rise in affect in the presence of a 
physically disabled individual. Subjective confirmation of this increase in affect 
was supplied by a questionnaire administrator at Indiana University when he 
compared the rapport established with two classes when he was in a wheel- 
chair and when he was able-bodied. He characterized the wheelchair class as 


being “embarrassed, unwilling to look me in the eye." In terms of equilibrium 
theory, when the disabled individual was present with Group U and a re- 
sultant rise in negative affect occurred, the reported attitude under these 

nge in affect; i.e., the behavioral 


changed conditions would reflect this cha : h 
ted in the questionnaire would be- 


and cognitive attitude components as repor 
come more consistent with the perceptual and affective components. This 


would suggest that the tendency (at the .85 confidence level) for Group U 
to express a more negative attitude toward the disabled when the disabled 


person was actually present than when he was not present was a function of 


rise in negative affect. 
The wheelchair administrators representing ambiguous (or perhaps even 


stereotyped?) disabled persons would tend to be perceived largely as a func- 
tion of the perceiver’s own expectations based on past experiences. For Group 
F the disabled administrator would serve as a stimulus to "remind" them of 
prior contacts with disabled students on campus. 

Rosenberg et al. (11, р. 55) suggest that when “affective-cognitive incon- 
sistency is established in a context which forces the person to think closely 
about it [as when one is completing an attitude-toward-the-disabled question- 
naire in the presence of a disabled individual] then the intolerance for incon- 
sistency is maximized.” This theory posits that the cognitive reporting of an 
attitude would reflect the nature of the heightened affect. Group U would 
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report a more-negative attitude; Group F, a more positive attitude. Rosen- 
berg’s response threshold to inconsistency states that one can tolerate incon- 
sistencies in his attitude structure in some situations better than in others, For 
both groups.the presence of a wheelchair administrator might have raised any 
inconsistency in attitude above the response threshold, thereby producing a 
changed response. Conversely, when no disabled person was present, the situ- 
ation was less reality bound, and any inconsistencies in attitude were below 
the response threshold. This suggests that the reported attitude when the dis- 
abled administrator was present was actually more consistent with the groups’ 
“actual” attitudes toward the disabled than was the attitude reported when 
no disabled individual was present. 

The extremely low correlation between dogmatism and reported ,attitudes 
toward the disabled may be a function of the interaction of several different 
variables. In his studies, Rokeach tended to use only the top and bottom 
groups of scores (i.e, the extreme high and low dogmatics), whereas this 
study used all scores. Second, the use of a split-half version of the DS prob- 
ably affected the general reliability of the instrument and thus may have con- 
tributed to a lesser correlation. Third, Weitman (12) makes a distinction 
between various kinds of authoritarians and points out that some individuals 
have indifferent or ambivalent attitudes toward those items in the scale 
designed to measure dogmatism. I£ the sample chosen for this study was simi- 
larly nonego involved, it would contribute to the rejection of the dogmatism- 
attitude relationship; such lack of ego involvement, however, would not 
necessarily carry over to the items measuring attitudes toward the disabled be- 
cause of the apparent perception by the subjects of the test situation as one 
dealing with disabled persons. 

Bruner and Tajfel’s discussion (3) of Arnhoff's failure (1) to replicate an 
earlier study dealing with the tendency of ambivalent ethnocentric personali- 
ties to overgeneralize suggested that the dogmatic individual may either over- 
generalize or undergeneralize, depending upon situational factors as perceived 
at the time. This inconsistency in the results of previous studies therefore 
points up the possible existence of numerous unanalyzed variables, which in 
the present study may account for the lack of correlation between dogmatism 
and attitudes toward the disabled. The results of this study do agree, how- 
ever, with the correlation of .13 between the ATDP and the DS, based on 
a sample of 103 college students, reported by one of the authors of the 
AT DE? 


The following possible influencing factors should be recognized: (a) In 


3 Personal Communication from J. R. Block, July, 1962. 
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class discussions following each final administration, several members of one 
of the Group F classes pointed out that the wheelchair examiner was probably 
not disabled since his legs were not atrophied and his shoes showed signs of 
normal wear. Several other members of this same class observed that they 
detected “slowness” in wheelchair maneuvering, even though the administra- 
tors had been trained in the use of wheelchairs by rehabilitation personnel. 
Such comments reflect familiarity with fellow students who are members of 
the Illinois Student Rehabilitation Program. (2) One class at the University 
of Illinois included a student disabled with moderately severe cerebral palsy 
who did not use a wheelchair. This student was not present during the ques- 
tionnaire administration. (c) Comments were made by members of the two 
Group F classes to the effect that they perceived both questionnaires (ATDP- 
DS-1 and ATDP-DS-2) as being identical and as obviously concerned with 
disabled persons. (4) Since the ATDP and the DS were randomly combined, 
a potential interactive effect might be assumed. Thus the taking of the ATDP 
might have affected scores on the DS and vice versa. Since the combined ques- 
tionnaires were perceived by some subjects as “having something to do with 
the disabled” and since the same ATDP items were repeated on both forms 
of the combined scales (whereas split-half DS items were used), it might be 
presumed that the ATDP had a greater effect on the DS than vice versa. (e) 
at the DS is subject to an acquiescent-response set (8, 
9). (f) Except for the “disabled”-nondisabled variable, the personality and 
general stimulus value of the administrators were purposely kept neutral. It 
is suggested, therefore, that the wheelchair administrators were viewed as 
"stereotypes": i.e. a function of the perceiver's own viewpoint toward dis- 
abled persons (2). (9) Although all University of Illinois students see phys- 
ically disabled students nearly every day, the amount of contact varies for 
each student. This difference in contacts was not measured in this study. (л) 
Since contact with disabled students at Illinois is presumed. to be mostly posi- 
tive, a learning-theory approach to these results could possibly be used to ac- 
count for the more-positive reported attitude at Illinois. (i) Generalizations 
beyond this study presumably should be limited to comparable college sam- 


It has been suggested th 


ples. 
E. SUMMARY 


Student attitudes toward the disabled and dogmatism were investigated in 
two midwestern universities: one having had an extensive program for phys- 
ically disabled students for many years; the other having had no such program. 
On the basis of "equilibrium theory," Hebb's neurophysiological theory, and 


Rokeach's work on dogmatism three hypotheses were tested. Two classes at 
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each university were on two occasions administered different combined forms 
of the Attitude Toward Disabled Persons Scale (14) and the Dogmatism 
Scale, Form E (10), once by a person in a wheelchair and once by a person 
not in a wheelchair. Tape-recorded directions were used. Between-school and 
within-school results were considered in terms of correlations between the 
scales and ¢ tests between the administrations. 

The first hypothesis was supported: Students familiar with disabled persons 
reported a significantly more favorable attitude toward the disabled than did 
students unfamiliar with them. The second hypothesis was tentatively re- 
jected, although the results were in the expected direction; i.e., when a wheel- 
chaired administrator was present, Group F reported a more positive attitude 
toward the disabled. Group U reported a more negative attitude than when 
no "disabled" administrator was present. Differences between the two groups 
were considerably more significant for wheelchair administrations than for 
able-bodied administrations. Although no significant correlations were found 
between dogmatism and attitudes toward the disabled, Group F demonstrated 
a significantly less dogmatic, more open-minded attitude structure. 

These results were interpreted as being in general agreement with equilib- 
rium theorists as well as with Hebb. The results regarding dogmatism, while 
inconclusive, were not in agreement with Rokeach’s theory. 
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А. INTRODUCTION 


The purpose of this study is to demonstrate the importance of demand 
characteristics as а confounding factor in psychological experiments, partic- 
ularly in those experiments that deal with the “new look” in perception. 

It is not a newly discovered fact that in many experiments the subjects 
respond not only to the manipulation of the experimental variable but also 
to the implicit expectations of the experimenter (8). In general, the con- 
ception of § as a helpful, cue-seeking individual has been ignored by most Es 
who claimed immunity to this bugaboo through the inclusion of the omnip- 
otent “control group.” Recently, however, Orne (9, 10) has discussed the 
far-reaching effects of what he calls demand characteristics and has emphasized 
aracteristics cannot be adequately treated by the typical control 
and characteristics have been demonstrated in sen- 
an experiment on cognitive con- 


that such ch 
group. The effects of dem 
sory-deprivation experiments (6, 11) and in 
(15). These studies have demonstrated that Ss can pick up cues 


sistency e д 
asis alone, respond in a manner 


from the experimental situation and, on that b 
consistent with the experimental hypotheses being tested. 

Some experiments are more susceptible to demand-characteristic responses 
The Atkinson and McClelland study in perception (3) seems to 
be representative of those that are easily affected. In that study, Ss were 
restrained from eating for as long as 16 hours. They were then brought 
into the experimental room where they were asked to respond to slides 
(blanks) projected on a screen in terms of various kinds of questions (some 
of which were of the multiple-choice type) that included one food item as a 
possible choice. The results indicated that under periods of long deprivation, 
Ss reported more food and food-related items than under periods of short de- 
privation. Furthermore, Ss reported more food items when the questions were 


than others. 


* Received in the Editorial Office, Provincetown, Massachusetts, on June 15, 1964. 
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more tightly organized; i.e., the multiple-choice questions pulled more food 
responses than the projective questions. The authors accounted for this phe- 
nomenon in terms of the strength of the “hint” implicit in the question. Fol- 
lowing Orne's suggestion, we hypothesize that the “hint” existed not only 
in the form of the questions but throughout the entire experimental design. 
The responses were not only a result of Ss’ physiological condition but also 
of their motivation to help E in a situation involving clear expectations, Be- 
cause of this possible confounding the Atkinson and McClelland results 
remain equivocal. Further research in this area (7) has improved the research 
design; however the original design is an excellent model for our demon- 
stration. 
B. METHOD 


The main methodological difficulty in an experiment on demand charac- 
teristics is to change the demand characteristics without changing anything 


could be given. 
l. Subjects 


Ss were 39 male and female undergraduates at the University of California 
enrolled in their first psychology course. Participation in an experiment was 
a requirement of the course. 


2. Procedure 


breakfast on the morning of the experiment—in fact, to eat a great deal more 
than usual. (Apparently a self-selection procedure operated because no coeds 
on diets would sign up for the experiment.) When S arrived at 10 a.m. the 
next morning he entered a room filled with medical-looking odds and ends; 
a poster describing the intricacies of the Krebb's Cycle, books with titles 
such as The Hypothalmus and The Living Brain, a table set Up as a cot, and 
an E dressed in a white coat with a stethoscope dangling from her pocket. 
After the preliminary greeting, E gave $ a description of the experiment.? 


2 Copies of the complete description of the experiment given to Ss, the three re- 
lease forms, the medical questionnaire, the placebo questionnaire, the instructions for 
the slides, and the questions used for the slides may be obtained from the senior 
author. 
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The description indicated to $ that he was about to receive a new and unusual 
kind of pill that would put his body in a state of hunger, even though he 
might not actually feel hungry. This pill was said to be as unusual as the 
psychomimetic drugs in that it could cause drastic effects in a short period 
of time. The explanation was as follows: 


What we have for you today is not quite as dramatic as the hallucino- 
gens but just as unusual. As you know, your hunger center is not in your 
stomach but in your brain, in the ventromedial nuclei of the hypothalmus. 
When did you eat and what did you eat? Good. That means you now 
have a high blood-sugar level. That is a controlled part of this experi- 
ment. Now if I were to give you a pill that would reduce your blood-sugar 
level suddenly and drastically, the ventromedial nuclei would start to 
send hunger messages to your digestive system, and eventually you would 
feel hungry, your stomach would start to contract and secrete gastric 
juices. I have just such a pill. I am particularly interested in the period 
of time between the activation of the ventromedial nuclei and the actual 
activity of the digestive tract: that is, the period when your body is in 
a state of hunger but you do not yet feel hungry. That is much like the 
neurons firing in your brain. They are active, but you are unaware of 
them; so the experiment consists of showing you some slides, giving you 
the pill, and then showing you another set of slides. 


At this time $ was given the option of leaving the experiment right then 
while still receiving his credit as a participant, for Ё would use only an $ 
who voluntarily stayed and who would sign some release forms. 

The Ss who agreed to stay (and all but one did) read and signed three 
forms, The first released the University of California and all affiliated per- 
sonnel from any responsibility for S's welfare during the experiment; the 
second released $'s medical records to E. The third release form stated that 
the drug was harmless if $ had eaten within the last hour and a half and 

ar to the fact that he had eaten within that time. Next, $ 


required $ to swe 1 
was asked for a brief medical history that consisted of 26 questions of the 


following type: "Have you ever had or do you now have rheumatic fever, 
TB, etc." “When was the last time you were in a hospital and why?" “When 
was the last time you saw a doctor and why?" Shortly after the foregoing 
questionnaire, Ё took S's temperature and pulse. A placebo questionnaire, 
based on the one used by Abramson (1), was then given with the following 


introduction: 

I am interested in any slight changes that may occur within you due 
to the pill; therefore, I will ask you these questions now and then again 
after the pill. You may not be feeling exactly normal right now because 
you are in a strange situation with an unknown pill coming that you are 
not very sure of. (Most Ss smiled and looked embarrassed at this point.) 
I want you to answer these questions using "yes," "no," or “maybe” as 
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answers. Compare yourself now to your usual condition—let us say, 
as you were an hour ago. The “maybe” category is very important be- 
cause it is unlikely that there will be any great dramatic changes but 
only subtle ones; thus, please try to be introspective and sensitive about 
the changes. 


b. Control-drug condition. Ss in the control-drug condition were then 
shown the first set of slides. The directions for the slides were only a slight 
modification of those used by Atkinson and McClelland. These directions 
state that the purpose of the experiment is to measure visual acuity under 
different physiological conditions. The instructions also emphasize over and 
over again the difficulty of the task and the need for what seems to be pure 
guessing. We added the following statements to the instructions: 

There are two kinds of slides. One is a white slide with a thin line 
drawn on it. The other is a blotchy black-and-white slide with a thin line 
drawn on it. In most cases you can tell when you have a white slide and 
when you have a blotchy slide—but not all the time, That’s how difficult 
they are. The best thing to do is to read the question, look at the slide and 
write without too much thinking in between. It is not legitimate to write 
“white slide, blotchy slide, white light, etc" You must name an object. 


Next, E showed $ how the slide projector and attachments worked and 
three sample questions: a multiple-choice question, a projective question, and 
a loosely structured question. After making sure that $ had no questions about 
the clerical procedure the first set o£ slides was projected on the screen. When 
all 30 slides had been presented, the placebo was administered, 

S was given two yellow capsules from a bottle marked “For Experimental 
Use Only" and a glass of water. A stopwatch was started and S was asked 
to lie down on the table for 10 minutes, during which time he was to think 
only about himself. E called this "socially approved thinking about yourself." 
Again, E emphasized that $ should not expect any dramatic changes but should 
be sensitive and introspective enough to report any of the subtle changes. After 
five minutes, E took 8° pulse and said, "Uh-huh." E told $ when six minutes 
and nine minutes had passed. After 10 minutes § returned to his original seat 
and answered the same set of questions, this time with the instructions to com- 
pare his present feelings with his feelings before he took the pill. At the con- 
clusion of the questioning 8 was asked to describe his feelings in his own words 
or to mention anything not covered by the questions. § was then shown the 
second set of slides. An interview followed and was aimed at discovering $'s 
knowledge of the hypothesis seemingly being tested, the effect of the pill, and 
belief in the pill. [See "Table 1 for a detailed description of the steps in 
all four conditions.] 
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c. Drug-control condition. Subjects in the drug-control condition went 
through the same introductory phase but were given the pill immediately after 
the first questionnaire, rested for 10 minutes and then were shown the first set 
of slides. After that, they were given the following information: 

It would take 20 minutes for these pills to wear off, which is necessary 
before I can show you the second set of slides. But since you would 
probably like to be finished as soon as possible, I will give you this pill 
that will counteract the first and return you to your former physiological 
state. 


S was given one pink capsule and asked to lie down for another 10 minutes; 
then he was shown the second set of slides and interviewed. 

The true nature of the experiment was revealed to all Ss after the interview. 
АП expressed astonishment at the elaborate measures used in the study. None 
seemed angry or upset at being the target of such procedures. 

d. Pure-control group. The first control group was a pure-control group. 
These Ss did not undergo any of the elaborate procedures mentioned earlier. 
This group simply signed up for a one-hour experiment. There was no notice 
of food; so they came to the experiment unaware that it was concerned with 
food responses. $ took the placebo questionnaire, the preface being that he may 
be feeling unusual because he was in an experimental situation. § was shown 
a set of slides, asked to lie down for 10 minutes, given the questionnaire a 
second time, and then shown the second set of slides. 

This procedure gives a base rate for three measures: the number of food 
responses given to each set of slides; the difference in the number of food 
responses given at the first and second viewing; and the number of placebo 
changes that can be ascribed only to changes over time, a change common 
for all groups. 

e. Control no-pill group. We hypothesized that § would perceive more 
food items in the blank slides, according to the demand characteristics of the 
experiment; that is, over two sets of slides the food responses would increase if 
the expectations were in that direction and would remain the same if expecta- 
tions were at a minimum. Would food responses decline or remain level when 
this was the requirement of the demand characteristics? To answer this 
question, a second control group was tested. Ss in the control no-pill condition 
thought they were in the experimental condition and, in fact, were treated 
exactly as were Ss in the control-drug condition up until given the pill. At 
that time, instead of giving $ the pill, S was told that he was in a control 
condition and would not receive the drug; although all other procedures 
would be exactly the same as in the experimental conditions. 

Thus we would expect no change in the number of food responses given to 
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the two sets of slides, for, at the second viewing, S’s knowledge that he was 
part of a control condition would suppress the number of food responses. In 
this hypothesis we agree with Orne (10), who says that every college sopho- 
more knows that control conditions are not expected to change their responses. 

In summary, through the use of a placebo we have arranged two experi- 
mental and two control groups. In the control-drug condition we hypothesize 
that Ss will report more food responses to the second set of slides: that is, 
under the influence of the placebo. In the drug-control condition we hypoth- 
esize that Ss will report fewer food responses to the second set of slides: that 
is, without the influence of the placebo. In the two control conditions (control 
no-pill condition and pure-control condition), we expect that Ss will report 
the same number of food responses to both sets of slides. 


C. MATERIALS Usrp 


Each $ saw a total of 60 slides, 30 under each condition. These slides were 
bordered with yellow or red tape, so that $ would be aware (if he was inter- 
ested) that he was being shown two different sets of slides. Within each set of 
30 slides, half were blanks and half were spotted with black smudges. These 
latter slides looked much as if ink had been dropped randomly over a white 
sheet of paper, but the blots were not symmetrical. Each $ was shown two 
spotted slides and then two blank slides continuing in an AABBAA order. 
Because the spotted slides differed from each other considerably, the order of 
presentation within a set was kept constant over all Ss. Using a slide projector 
with a Veriac attachment, set at 50 volts and the shutter at .01 second, 
the slides were projected on a screen about 15 feet away from the S. Each slide 
was shown twice at the .01 speed. 

'There were two sets of questions: Set А and Set B. Each question was 
written on an index card for S’s convenience in keeping his place, and the $ 
wrote his answer on a mimeographed answer sheet. In each set of questions 
there were 10 multiple-choice questions, 10 loosely structured questions, and 
10 projective questions. Each question could be answered with a food response. 
Examples of the questions are as follows: (a) a multiple-choice question—1) 
Lemon, sun, Christmas-tree ornament, bracelet. 2) Test tube, mirror, per- 
fume, glass of wine. (2) a loosely structured question—1) A woman. What 
is she doing? 2) A holiday scene. What is happening? (c) a projective ques- 
tion—1) What is it? (All projective questions were the same.) 

Within each set of questions a fixed random order was used. All possible 
combinations of slides and set of questions were used, each $ being assigned 


randomly to any one combination. 
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D. Resutts AND Discussion 


The placebo effect is one of the most striking results of the experiment and 
strong evidence for the successful manipulation of the independent variable 
in at least one of the experimental conditions [see Table 2]. The control-drug 
condition shows significantly more physical changes as reported on the ques- 
tionnaire than any of the other conditions. The comparison of this control- 


TABLE 2 

MEAN NUMBER ОЕ PLACEBO CHANGES IN EACH CoNpiTION* 

Condition Mean 

Control-drug 10.5 

Drug-control 7.0 

Control no-pill 5.5 

Pure control 4.7 
* £ between control-drug condition and drug-control condition — 2:5; # = 01: 
t between control-drug condition and control no-pill condition — 2.63, p = .02; and 
t between control-drug condition and pure-control condition = 4.22, p = .01. No sig- 


nificant differences between the three last conditions. 


drug condition with the two control groups that did not receive pills of 
any kind allow us to allocate the reported physical changes to the demand 
characteristics of the placebo rather than to the passage of time, to the practice 
with the questionnaire, or to the 10-minute rest period on the hard table 
(as many subjects tried to make E believe). 

The drug-control condition, however, showed no more placebo changes than 
did the two control groups and significantly less (5 < .02) than the control- 
drug condition; thus it seems that the manipulation was not successful in this 
order of drug presentation. 

A rating made at the conclusion of the experiment offers data for the fore- 
going hypothesis. Ss were asked to rate their belief in the pill as that belief 
existed when they were resting immediately after taking the pill (1 = no 
belief, 7 = belief). No doubt this is a difficult rating to make, for it involves 
the recollection of a previous state of belief, and Ss may become suspicious 
about the pill due simply to the fact that the question was asked. Never- 
theless, the rapport between Ss and E was unusually good at that time; and 
we believe that the answers are essentially honest though biased in the up- 
wards direction. Almost one third of the Ss used the rank of seven, the highest 
rank possible for indicating belief in the pill. Twelve out of 17 Ss ranked 
their belief as five or higher. Only two Ss used the rank of one, and both of 
these were in the drug-control condition. When one looks at the ranking made 
by the control-drug Ss versus those made by the drug-control Ss, he sees that 
the mean ranking for control-drug Ss is six; while that for drug-control Ss 


=. 


э 
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is only 4.14. The difference is significant at the .01 level. Note should also 
be taken of the awkwardness of the manipulation of the placebo variable in 
the drug-control condition: in particular, of the fact that it was necessary to 
give Ss two pills—one to make them hungry; another to return them to the 
“full” state. Therefore, any risk involved in insuring the credibility of the 
placebo is doubled under this condition. On the basis of nonbelief in the pill 
and awkwardness of the experimental procedure, we conclude that the drug- 
control condition is an inadequate test of the hypothesis because the situation 
was not credible to Ss. 

In the control-drug condition, Ss showed significantly more food responses 
under the drug than under the no-drug conditions (p = .025, one-tail Walsh 
test) ; thus our main hypothesis is confirmed. Demand characteristics can in- 
fluence S’s perception of ambiguous stimuli. No other conditions showed a 
significant change in the number of food responses over the two viewings of 
the slides; rather, the number of food responses remained constant within the 


three other conditions [see Table 3]. 


TABLE 3 
Mean NUMBER оғ Foon Responses GIVEN BY EACH CoNDITION 
TO EacH SET or SLIDES 


Condition Slide Set 1 Slide Set 2 pe 
Control-drug 3.54 5.82 .025 
Drug-control 43 m n.s. 
Control no-pill 3.14 25 us 
Pure control 4.85 2 "m 


* All tests are one-tailed, The test used is the nonparametric Walsh test (14). 


As expected, the drug-control condition appears to be more like a control 
condition than like an experimental condition. One possible reason for this 
is the failure of the manipulation of the independent variable. If the pill 
had worked adequately we would expect that in Ss’ first viewing of the slides, 
which occurred immediately after taking the pill, Ss would respond with 
a larger number of food items than would those who had not yet taken the 
pill and, therefore, were still “full.” However, there was no difference be- 
tween conditions on the number of food responses given to the first slide set. 

The control no-pill group showed overall the fewest number of food re- 
little change from the first to the second set of slides. On the 
first set of slides, when they are under the same condition as the control-drug 
group they show virtually the same number of food responses (3.14 for the 
control no-pill group vs. 3.54 for the control-drug group) ; however, when 
they find out that they are in the control condition they do not alter their 


sponses and very 
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level of food responses for the second set of slides. The difference between 
the control no-pill group and the control-drug group on the second set of 
slides does not reach the .05 level, but the authors contend that ingestion of 
the pill and acquiescence to the expectations associated with the pill directly 
affected perception in the control-drug condition.? 


The pure-control group follows the Same pattern of no change in the 
number of food responses given to the two sets of slides. Interestingly, there 
is a significant difference between the control no-pill group and the pure- 
control group on the number of food responses given to the first set of slides 
(5 = .05). If one assumes that the Pure-control group is the baseline group 
(Ss in that group have no knowledge of food, hunger, or pills) while Ss in 
the control no-pill group are food satiated, then the difference in number of 


The postexperimental interview had two purposes: to check the credibility 


of the placebo manipulation and to estimate Ss’ awareness of the demand 
characteristics. 


I just feel that I can’t see as sharply. The sort of thing as when you're 
tired, but I don't feel tired. I feel like you feel before you get hungry. 
Just feel like I'm not focusing. My attention span is lapsing. 

As I was lying down I could hear my heart beat and feel my mouth 
and lips become dry and my head hurt. My right afm got cold and 
heavy. Can't explain the sensation in my hands, just in my hands, tingling 
sort of. More aware of noises in the room. Doesn't really seem like a 


8 We had no way of measuring the actual physiological state of hunger of each S. 
However since all $s in three of the conditions came to the experiment in a satiated 
state, we assumed that any change in appetite due to a discussion of food would be 
distributed randomly over all conditions. 
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dream but doesn’t seem real either. My hands feel like they are shaking 
though I know they aren't. 


As in the second description, many Ss complained of a headache because of 
the 10 minutes spent lying on the hard table; yet not one $ in the control 
groups ever attributed a headache to the table. 

In general, the placebo reactions were vague and hard to describe, usually 
centering aound feelings of pleasantness, calm, and relaxation. Specific 
changes, such as moist palms and numb lips, were always similar in topic to 
the questions that previously had been asked on the first questionnaire. No 
one ever reported tingling scalp or butterflies in the stomach—topics not 
touched upon by the questionnaire. Virtually all of the questions in the ques- 
tionnaire were responded to positively by at least one subject. Only six out 
of the 46 questions never showed a placebo response. These six questions 
refer to serious physiological changes, such as feeling ill and abnormal hearing. 
The most frequent questions to show an effect were those that centered 
around autonomic-nervous-system reactions pertaining to increased salivation, 
feeling anxious, and feeling drowsy. Nine or more $$ responded positively to 
these questions. 

During the interview each $ was asked, “Were there more questions 
pertaining to food on the first set of slides, the second set, or were they about 
the same?" Regardless of whether the pill was given on the first or second 
set of slides, 13 out of 17 Ss who were asked reported that there were more 
food questions under the drug condition. There were only three reversals 
and one report of equal. Certainly, then, Ss had food on their mind during 


the drug period. 
Another question in the experiment was, “Now you tell me what do you 


think is going on in this experiment? What is being tested?” If 8 showed 
signs of repeating the description previously given by E, $ was urged to 
thought was being tested. Only one $ out of 18 was 


report what he really г 
purported purpose of the experiment. Some typical 


unable to verbalize the 
answers are as follows: 


I knew my mind was supposed to 
pill would have an effect on my bein 


would not have seen so much food. | Pa 
You are trying to find out if by suggesting something it would make 


me change my perception so that I will see food. 
If I had been hungry I would choose appealing food w 


card. If I wasn’t I would not. 


Thus all Ss realized that their physiological state was associated with what 


be on food. You told me that the 
g hungry. If you hadn't said that I 


ords on the 
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they saw on the screen and that if they were hungry they were supposed to 
see more food items. Note that this was never mentioned in the instructions 
or introduction to the experiment. 


The demand characteristics of the experiment can never be completely 
eliminated (10), but investigators must not confuse responses to these de- 
mands with responses to the independent variable, 
control-drug condition produced results such that if 


Periment were unknown they would seem to uphold the Atkinson and 
McClelland results (3), yet it is clear that the results obtained are 
of the experimental design. Even before Ss entered 


In this experiment, the 
the context of the ex- 


artifacts 


аге no cues 


two sets of slides because t expectation 
that they should do so. The -€ the control-drue group), 
on the other hand, had л iological Teason to change Фан the 
control groups, but the expectation of the situation was Strongly in favor of 
change, and $5 in the group responded to this expectation, 

College sophomores or 


any Ss in an e 
especially if they feel that the experiment is Personally important to E. At the 
same time, Ss try to assign a meaning to their experi | 
сап be satisfied by followin 
according to these cues, 
cepted by Ё as confirmation of his hy; 


We do not mean to imply that all experimental results are the products 

U "i TM M " 
of E's implicit wishes, but simply that all experimental designs should be 
carefully searched for such pitfalls, Perhaps Perception is influenced by 


physiological need and, indeed, it might be; but in the plethora of 
relevant studies we cannot be su gical needs inf e 

В s c 
perception or the need to please E ; idm 


Xperiment are pleased to help E, 


act 


ten quickly and gladly ac- 
potheses, 
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MEASUREMENT OF RADICALISM-CONSERVATISM* 


Department of Psychology, University of California, Los Angeles 


ANDREW L. Comrey AND Јонх A. NEWMEYER 


A. INTRODUCTION 


‘The isolation of a radicalism-conservatism attitude dimension by factor 
analytic methods seems to have been reported first by "Thurstone (17). Since 
that time, other investigators (1, 6, 7, 10, 14, 15, 16) have found similar 
factors. The work of still other investigators leaves some doubt about how 
general such a factor is or whether or not it is best considered to be a com- 
posite of substantially independent variables (8, 9, 13). Most of these studies 
were carried out at a time when computing facilities placed a severe limitation 
on the number of variables that could be investigated. This fact, together with 
the widely varying analytic methods employed and divergent results from 
previous studies, makes apparent the need for additional research in this area. 

The present investigation represents an attempt to apply the factored ho- 
mogeneous-item-dimension approach (2, 4, 5) to the structuring of political 
and social attitudes. "The essence of this approach is embodied in two prin- 
ciples: (а) knowledge of the factor structure in a domain should be derived 
from factor analysis of total scores over groups of homogeneous items rather 
than from factor analysis of single items and (5) groups of items that pro- 
vide total scores that are alternate forms of cach other should be combined 
into one score rather than included as separate variables in the factor analysis. 

Application of the first principle tends to increase the reliability of the 
variables used; hence favors greater stability of factor results. Application of 
the second principle avoids the distortion of factor results by overlapping spe- 
cific variance. Such distortion is common in the factor analysis of single items 
because it frequently happens that several pairs of similar items appear in the 
collection to be analyzed. Within such item subgroups of two or even more 
items there may be substantial item intercorrelations that tend to dominate 
the factor solution. 

B. PROCEDURE 


1. The Sample 
Two hundred and twelve volunteers participated in the study. These indi- 
viduals were recruited by soliciting university-student volunteers; members of 
UE 


s Received in the Editorial Office, Provincetown, Massachusetts, on June 17, 1964. 
Copyright, 1965, by The Journal Press. 
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Los Angeles social, religious, and fraternal organizations; friends; and vol- 
unteers from a form distributed door-to-door in randomly selected blocks in 
Los Angeles. All participants were promised an analysis of their questionnaire 
results in return for their cooperation. The subjects may be described in the 
following terms: Forty-one were under 21, 42 were aged 21 to 24, 25 were 
25 to 29, 41 were 30 to 39, 37 were 40 to 54, and 26 were over 54. One 
hundred and forty of the subjects were male; 72, female. Sixteen subjects 
had not finished high school, 19 were high-school graduates, 72 had some 
college work, 57 were college graduates, and 48 had some graduate work. 
Five subjects described themselves as conservative Republicans, 74 as Repub- 


licans, 32 as Independents or as having no political preference, 80 as Demo- 
crats, and 12 as liberal Democrats or Socialists, 


2. The Questionnaire 


Thirty political and social-attitude var 
a review of previous work and some subje 
to cover major areas without identific. 


iables were selected for study after 
ctive analysis. An attempt was made 


ation with specific issues apt soon to be 
outdated. An attempt was made to write four homogeneous items to measure 


each of the 30 attitude variables. Each item was to be answered by a selection 
from the following scale: 1) Agree Very Strongly, 2) Agree Strongly, 3) 
Agree, 4) Agree Somewhat, 5) Neither Agree nor Disagree, 6) Disagree 


Somewhat, 7) Disagree, 8) Disagree Strongly, 9) Disagree Very Strongly. 
. The 120 items were arranged in a booklet—one item for each dimension in 
the first 30 items, a second i i i 


in the second 30 items, and 


Я е numbers were treated as 
scores, 


Scores for the 120 items were intercorrelated and factor a 
analysis purposes. Factors Were extracted by the minimum-residual method (3), 
and rotated orthogonally by the normal-varimax criterion (12). Inspection 
of the factor results and the intercorrelations among items reveals the fol- 
lowing facts: 


nalyzed for item- 


: : +60; and inter- 
correlations across item groups w i 

fore, the items for these two gro 
Religiosity. 


—— 
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2. Items for two variables originally called Pacifism and Militarism also 
combined to form the same item factor, albeit at opposite ends of the con- 
tinuum. Seven items of the original eight items had loadings of .68 or more 
on this item factor; hence were combined to form one dimension: Pacifism. 
The militarism items were reversed in the scoring, of course. 


3. Items for the variables Weak Federal Government and Pro-Free- 
Enterprise defined a single item factor, all eight items having loadings of .46 
or more. These items were combined to measure one dimension: Weak Fed- 
eral Government. 

4. Items for the variables Censorship and Restrictions on Sexual Freedom 
were combined for the dimension Moral Censorship, although the variance 
(for all items) for these two variables divided between two item factors. 


5. The variable Birth Control split into two distinct item factors (two 
items each): Population Control and Contraception. 


6. As expected, several other variables showed up as item factors and were 
retained as dimensions; although in many cases less than four items proved 
to have factor loadings high enough for the items to be retained. These di- 
mensions were Antiunionism, Pay Based on Ability, Partisanship, World Gov- 
ernment, Fatalism, Belief in People, and Education for Adjustment. 


7. Several other variables failed to define separate item factors but ap- 
peared on item factors with other groups of items. Inspection of item factor 
loadings, patterns of interitem correlations, and item content, however, failed 
to justify pooling these items with others but suggested their retention as 
separate variables. These dimensions were Welfarism, Racial Tolerance, 
Severe Treatment of Criminals, Service to Country, Rapid Social Change, 
Traditional Moral Values, Fascism, Respect for Age, Service to the Individ- 
ual, Indoctrination, and Cultural Ethnocentrism. 


Twenty-nine variables were selected for the main factor analysis. Twenty- 
five of these were total scores over groups of two or more homogencous items. 
The last four variables consisted of single biographical items. The names of 
the variables and the items used to measure them are presented later. After 
each variable name, the mean, standard deviation, and a reliability estimate 
for that variable is given. The reliability estimates were obtained by comput- 
ing the average interitem correlation to obtain the reliability of a test of the 
appropriate number of items. The A and B and the number following certain 
items represent the form to which the item has been assigned and the factor 
loading for the item. This will be explained further later on. 
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С. ITEMS 
l. Religiosity (47.8, 15.3, 94) 


Every child should have religious instruction (A, .70). 

God exists, in the form in which the Bible describes him (A, .67). 

This country would be better off if religion had a greater influence in 
daily life (A, .62). 

School teachers should believe in God (В, .73). 

It should be against the law to do anything which the Bible says is 
wrong (B, .59). 

Moses got the ten commandments directly from God (B, .66). 

АП people alive today are the descendants of Adam and Eve (A, .53). 

АП the miracles described in the Bible really happened (B, .59). 


2. Pacifism (45.2, 13.5, 93) 
Under no circumstance У use nuclear bombs against 
anybody (B, —.56). 


I would rather have a foreign power take over 
another world war to stop it (B, — 79). 

This country shoul, 
do (A, —66). 

If my Country had b 


d disarm regardless of whether Ог not other countries 


r our allies, 
Our country should be engaged constantly i 


The average man today is getting less than his rightful share of our 
National wealth (A, —.52), 


The government 5 


ent to take care of people who 
can 
If the government must go deeper in debt to help People, it should do 
so (В, —.66). 
4. Anti- 


Unionism (20.6, 3:8: 77) 


officers with crimin 


> 


ES 
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5. IFeak Federal Government (36.4, 13.8, .92) 


Central government should run only those things which cannot be run 
effectively at the local level (A, .52). 

The federal government has too much power over citizens and local 
government (A, .62). 

Greater decentralization of power would be better for this country 
(A, .64). 

The federal government should not interfere in the affairs of individual 
states unless absolutely necessary (B, .66). 

The strength of this country today is largely a product of the free 
enterprise system (B, .68). 

A greater degree of government control over business would result in 
à weakening of this country's economy (A, 73). 

Regulation of business by government usually does more harm than 
good (B, .65). 

When something is run by the government, it is apt to be inefficient and 
wasteful (B, 47). 


6. Moral Censorship (55.3, 13.1, .90) 

Motion pictures which offend any sizeable religious group should be 
banned (B, .49). 

If a man is showing a sex movie to friends in his own home, the police 
should stop it (A, 43). 

Every city should prevent the sale of objectionable books (A, .68). 

Public libraries should contain only books which are morally sound 
(B, .62). 

Sexual relations between unmarried people should be illegal (A, 60). 

The police should hunt down homosexuals and put them in jail (A, 
:61). 

А woman who has sexual relations with a man for money should go 
to jail (B, .58). 

More restrictions should be imposed to prevent young people from 
having sexual relations before marriage (B, .68). 


7. Population Control (5.3, 3.0, .63) 


In overpopulated countries the people should be taught to practice 
birth control. 
Uncontrolled population growth is a dangerous menace. 


8. Contraception (5.8, 3.6, .62) 


Abortion should be legalized (A, —.47). 
Birth control devices should be made readily available to anyone who 
wants to use them (B, —46). 


9. Racial Tolerance (13.4, 5.6, .68) 


Marriages between persons of different races should be socially accept- 
able (B, —.67), 
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Employers should be Prevented by law from hiring only people of their 
own race (A, —.39). 


Social organizations limiting membership to a Particular race should 
be abolished. 


10. Severe Treatment of Criminals (21.7, 6.4, 82) 


In our country, 
light (A, .66). 


11. Pay Based on Ability (EE, 3:8; 62) 
A well educated 


Person should have a higher income than one who is 
less well educated, 


12. Capital Punishment (20.3, 8.1, 86) 

A mentally ill man 
executed (A, 57). 
A dictator who 


who attacks and kills a little girl should be 
Orders the extermination of thousands o 


€cuted for his crimes (B, .57). 
A gunman who ki 


robbery should receive the 
death Sentence (A, .69). 
Someone who 


f innocent 


plans and carrie: 


S out the murder of his or 
should be executed (B, 71). 


18, Partisanship (26.1, 4.6, 68) 

The political Party which I fay, 
country than any other prese 

I would not support a Political Candidate of the Major Oppositio; 
to my own. 

Leaders of the major political Party opposite to my own party are 
generally cynical, selfish men who disregard the national welfare, 

I am inclined to dislike anyone who can accept the view of th 
political party opposite to my own. 


or is unquestionably better 


for the 
nt party, 


n party 


e major 


14. Service to Country (18.0, 7.6, 85) 


Every able bodied male should willingly serye for a period of time in 
his country’s military service (A, .70), 

If called upon to do so, a citizen should be willing to Sacrifice his life 
for his country (B, .59), 


— Pop. 
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Patriotism is one of the great virtues (B, .74). 
A man who is ready to die for his country deserves the highest honor 
(A, .62). 


15. Rapid Social Change (15.7, 4.7, 65) 


Where social injustice exists, it must be changed immediately regardless 
of the consequences. 


People who are not willing to give others equal rights should be re- 
quired by law to do so. 


People who discriminate against others because of race or religion 
should be treated like criminals. 


16. World Government (19.8, 9.0, 90) 

The United States should work peacefully for a strong world govern- 
ment (A, —.55). 

The United States should be willing to surrender some of its rights 
to strengthen the United Nations (A, —.80). 

The United States eventually should give up its military power to a 
strong world government (B, —74). 

Present nations should become states within an all powerful world 
government (B, —.61). 


17. Fatalism (15.2, 2.8, .67) 


Man cannot alter his destiny. 
Each person's life is laid out for him in advance. 


18. Traditional Moral Values (6.0, 2.8, .52) 


People should work hard and pay their debts on time. 
A man should always be faithful to his wife. 


19. Fascism (11.8, 3.3, .52) 


A nation will generally progress more rapidly under the guidance of 
a single strong, benevolent leader. 

Representative bodies like Congress hav 
divisions to function effectively. 


20. Respect for Age (102, 3.1, 63) 


As people get older, they usually become wiser. 
Older people usually have a more intelligent view of life than younger 
people. 


* too many arguments and 


21. Service to the Individual (10.3, 3.6, .73) 
Laws which benefit the people are more important than laws which 
Strengthen the nation (A, —.63). 


The welfare of the individual is more important than the welfare of 
the country (B, —48). 
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22. Indoctrination (18.3, 6.6, 77) 

Children should be taught early that our country 

in the world. f 

There are things a child must be taught to believ 
to understand them. 

Moral values should be instilled in children whil 


Children should be taught to obey their paren 
argument. 


is the greatest country 
e before he is ready 


e they are very young. 
ts without question or 


23. Belief in People (13.0, 3.9, .68) 

The average person, if suddenl 

try to do an honest and hardwor 

Only a few people would deli 
from the government. 

Most people are basically honest. 


у handed a government position, would 
king job. 


berately lie to get welfare support money 


24. Cultural Ethnocentrism (29.3, 4.4, .73) 

Cultural exchanges with hostil 

The people of this nation ha 
nations. 

Americans should 
foreign imports, 

American students should be shielded from foreign ideologies and 
philosophies until they are adults. 


€ nations are of little benefit to us. 
ve little to learn from peoples of other 


Patronize American art and music rather than 


25. Education for Adjustment (11.0, 3.8 


Schools should spend more time teaching children how 
each other, 


y) 


High school education should be i ping the 
student make a good social adjustment, 


26. Age (2.3, 1.7, —) 
Below 21 was scored as 0; 21-24, 1; 25-29, 2; 30-39, 3; 40-54, ^; and 
over 54, 5, 


27. Sex (0.4, 0.5, —) 


Males were scored 0; females were scored 1, 


28. Educational Level (2.5, 1.2,—) 
Less than high school graduation was scored 0; high school graduate, 
1; some college, 2; college graduate, 


3; some graduate work, 4, 


29. Political Preference (2:1, 20; 


Conservative Republicans were scored 0; Republican, Li 
or no preference, 2; Democrat, 3; 


о Independent 
liberal Democrat or Soci 


ialist, 4, 
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D. THE ANALysiIs 


A total score was computed for each of the 212 subjects over each of the 
25 item groups by adding the numbers of the responses selected by the sub- 
ject for those items. In the cases of Pacifism, the scores for three of the items 
had to be subtracted from 10 before adding to maintain a uniform direction 
in the items scored because these three items measured the opposite end of the 
continuum. These 25 scores, plus scores for age, sex, educational level, and 
political preference were intercorrelated (Pearson r). This 29-by-29 corre- 
lation matrix was factor analyzed by the minimum-residual method (3), and 
nine factors were extracted. This method is similar to the principal-compo- 
nents method of factor analysis except that no communality estimates are 
used or needed. Convergence could not be obtained for a tenth factor. 

The nine factors were rotated orthogonally by the normal-varimax cri- 
terion (12). These orthogonal factors were further rotated obliquely using 
simple structure and psychological meaningfulness as criteria to guide the ro- 
tations. The reference-vector projections and the correlations among reference 
vectors were utilized to obtain the table of correlations among the factors.! 
The table of correlations among the factors was analyzed by the principal- 
factor method, using unities in the diagonals, to provide knowledge of the 
second-order factor domain. 


E. RESULTS 
Five of the nine factors had at least two factor loadings over .60. These 
factors will be presented along with the variables that had factor loadings of 
.30 or more. 
1. Factor II. Welfare-State Attitudes 


3. Welfarism, —.82; 29. Political Preference, .78; 5. Weak Federal 
Government, .68; 4. AntiUnionism, .50; 23. Worthiness of People, —.42; 
15. Rapid Social Change, —.39; and 21. Service to the Individual, —.32. 


2. Factor III. Punitive Attitudes 


10. Severe Treatment of Criminals, —.70; 12. Capital Punishment, 
—.66; 6. Moral Censorship, —.47; 7. Population Control, —.46; 19. Fasc- 


1 The computations were carried out on the IBM 7090, operated by the UCLA 
Computing facility. The following items have been deposited with the ADI: the 
matrix of correlations among dimensions, the minimum residual factor matrix, the 
varimax factor matrix, the oblique reference vector projection matrix and the trans- 
formation matrix to get it from the varimax matrix, the oblique factor pattern, and 
the oblique factor structure. Order Document No. 8507 from the ADI Auxiliary 
Publications Project, Photoduplication Service, Library of Congress, Washington 25, 

C. remitting in advance $1.25 for 35 mm microfilm or $1.25 for photocopies. Make 
check payable to Chief, Photoduplication Service, Library of Congress. 


366 JOURNAL OF SOCIAL PSYCHOLOGY 


ism, —.37; 20. Respect for Age, .37; and 24. Cultural Ethnocentricism, 
Я мы] Ы 
—4з3. 


3. Factor IV. Nationalism 


14. Service to Country, —.83; 2. Pacifism, -82; 16. World Government, 
+66; 7. Population Control, 49; 21. Service to the Individual, 48; 22. In- 
doctrination, —43; 19. Fascism, .31; and Age, .30. 


4. Factor VII. Religious Attitudes 


1. General Religiosity, —.69; 8. Contraception, 76; 6. Moral Censor- 
ship, —.65; 7. Population Control, 


44; 24. Cultural Ethnocentricism, 
— 55; and 17. Fatalism, —32, 


5. Factor 1Х. Racial Tolerance Attitudes 


15. Rapid Social Change, —.70; Racial Tolerance, —.65; and 25. 
Education for Adjustment, —.31, 
The negative loadings for the attitude variables are negative when consis- 


tent with the factor-title direction due to the scoring of “1” for Agree Very 
Strongly and “9” for Disagree Very Strongly. 


The remaining four factors are of lesser importance, Factor I had loadings 


5 20, 13, and 27, respectively, Factor V had 


TABLE 1 
CORRELATIONS AMONG Facrons* 
Fact 
Factor 1 2 3 4 "s ы 6 7 8 3 

1 
2 —10 
3 32 —56 
4 00 69 —7 
5 —05 36 —29 35 
6 16  —35 їп —45 ig 
7 04 —48 46 —60 —23 26 
H 02  —32 ё —48 — . 28 12 29 
9 16 55 —43 52 22 —10 —31 —23 
* See p. 358 for explanation of factors, 


CS 


| 
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factor resulted, accounting for 42 per cent of the total variance. Loadings for 
the five most-important primary factors on this second order factor were as 
follows: II. Welfare State Attitudes, .81; III. Punitive Attitudes, —.82; 
IV. Nationalism, —.90; VII. Religious Attitudes, —.68; and IX. Racial 
"Tolerance Attitudes, .62. Loadings for factors I, V, VI, and VIII were —.14, 
49, —.51, and .56 respectively. This second-order factor is considered to 
represent the radicalism-conservatism attitude continuum, At the radical end 
of the continuum we have belief in the welfare state, powerful federal goy- 
ernment, pacifism, world government, racial tolerance, and rapid social 
change. On the conservative end of the continuum, we have belief in severe 
treatment of criminals, capital punishment, religion, moral censorship, and 
service to country. 

Dimensions 1 through 10, 12, 14, 15, 16, and 21 had substantial loadings 
on the five primary factors that best defined the second-order factor of radi- 
calism-conservatism. These 15 dimensions were measured by a total of 67 
items. To select items for measuring radicalism-conservatism, these 67 items 
were intercorrelated (by Pearson r), and a single minimum-residual factor 
extracted. Thirty pairs of matched items were selected from the total of 67 
items to provide two forms, A and B, of a radicalism-conservatism test. The 
number of items taken from any given dimension was the same in both forms. 
Items in the two forms were also matched for size of loadings on the single 
general factor in the analysis of 67 items. Those items selected for Form A 
are those which have an “A” in parentheses following the item as shown 
above. The factor loading on the general factor from the 67-item analysis 
follows the A. Items for Form B and their factor loadings are similarly in- 
dicated. If the same scale of response is used as that employed here, scores on 
the items stated in the radicalism direction must be reversed by subtracting 
each such item score from 10. This is required for cach item that has а nega- 
tive factor loading after the letter A or B above. Adding up item scores, in- 
cluding the reversed item scores, for the 30 items in Form A and the 30 items 
in Form B will give two measures of radicalism. The low scores indicate con- 
servatism. 

In the present sample, for Form A, the mean and standard deviation, re- 
spectively, were 159.0 and 41.3. For Form B, they were 158.4 and 41.8 re- 
spectively. For the total score, Form A plus Form B, the corresponding sta- 
tistics were 317.4 and 82.3. The alternate-forms reliability estimate is -96, 
correlating Form A and Form B. This corrects to .98 for the total score, 
although these figures can be expected to drop with a new sample. The dis- 
tributions are approximately normal in this sample. 
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F. Discussion 


This investigation helps to resolve some of the differences that have ap- 
peared between findings in Great Britain and this country. Ferguson (8, 9), 
for example, structured the domain in terms of three primary social attitudes, 
Religionism, Humanitarianism, and Nationalism, with no Radicalism-Con- 
servatism factor. His three primary social attitudes correspond closely to three 
of the primary factors obtained here, Religious Attitudes, Punitive Attitudes, 


and Nationalism respectively. The Radicalism-Conservatism factor is to be 
found in the substantial correlations between his thre 


e primary social atti- 
tudes. In fact, 


because of this correlation, Ferguson found it necessary to 
analyze two attitudes at a time rather than all three together. 


British authors, on the other hand, have typically extracted few factors 
and have more often than not left them unrotated. If there is a general factor, 
as in the present case, it will appear at this fi 


rst unrotated factor. Thus, the 
British authors have identified Radicalism-Conservatism as а first unrotated 
factor. 

Thus, rotation of first- 
tions approximating simp 
around several distinct у; 
tarianism, and so on) w 
variables factor analyze 
tism as the first genera 
related primary factor. 


order factors to either orthogonal or oblique posi- 
le structure tends to organize the attitude domain 
ariables (such as religionism, nationalism, 


ith no Radicalism-Conservatism factor. Th 
d without rotation will 


humani- 
ese same 
show Radicalism-Conserva- 
eveal appropriately the cor- 
adicalism-Conservatism. The 
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THE LANGUAGE OF POSTURAL INTERPRETATION* 12 


Department of Psychology, Wesleyan University 


Rosert Н. Knapp 


A. INTRODUCTION 


In earlier papers (1, 2, 4), the author has discussed the “diadic silhouette.” 
One of those papers reports the distinctive manner in which male schizo- 
phrenics are prone to interpret such stimuli. Another deals with how the 
judged acceptance of one figure by the other is influenced by alterations in 
posture. A third describes the manner in which different types of inter- 
personal transactions are ascribed to different versions of the diadic silhouette. 
In the present article, the author reports data roughly parallel to those in 
the third of the previous articles. This article differs from the earlier one 
in that it presents a level of generality not reported heretofore, it deals with 
a new and improved series of stimuli, and it calls for a pattern of responses 
altered from that presented in the earlier studies. 

In the Diadic Silhouette Test, two profile human figures (male and female) 
are presented in 27 postural variations. Nine of the postural variations involve 
a condition in which the female is at a higher elevation than the male, nine 
in which the two figures are at equal levels, and nine in which the male 
figure is higher. Further variation is introduced by rotating the male and 
female figures from backward to upright to forward positions. The com- 
binations of three elevation patterns with the three rotational positions for 
each figure yield 27 silhouette variations. A departure from the stimuli used 
in earlier studies (1, 2) lies in the fact that the figures in the present instance 
were mature and not juvenile, and three rotations of the figure rather than 
four were employed. 

The subjects were required to view each diadic silhouette and to describe 
the nature of the transaction or exchange occurring between the two figures. 
For that purpose they were provided with a “menu” consisting of 16 items, 
They were asked to specify which of the two figures was initiating the ex- 
change, whether the “act” involved was “telling,” “asking,” “demanding,” or 
“offering,” and whether the other party was responding affirmatively or nega- 


* Received in the Editorial Office, Provincetown, Massachusetts, on June 26, 1964, 
Copyright, 1965, by The Journal Press. 

1 Support for this research was obtained in part from the National Institutes of 
Mental Health under Contract M-2178 to Wesleyan University. 
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tively to the initiated action. In this study the "menu" was identical with 
that in the previous studies except that the act "demanding" replaced “com- 
manding." It should be observed in Passing that each Subject was free to 
employ each of the 16 menu items as often as he felt it appropriate. 


B. METHOD AND PROCEDURE 


discussed. One should note, howeve 
silhouettes twice, once in its blue an 
responses to each silhouette. 

Just as the color of the b 
determinant of the interpretation placed on th 


r, that each individual rated the 27 
d again in its red form, thus yielding two 


based upon the revised 
- Tables 1 to 6 show how 
ed to the 27 diad variants. 
houette, the male figure is 


ted chi square with 26 de- 
ull hypothesis that a 


243 exceeds the .01 level 
There is a fairly incisive р 
appears that the male is seen 


Yr 
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| TABLE 1 
| THE INCIDENCES ОЕ RESPONSES INVOLVING THE MALE INITIATOR 
| FOR EACH OF THE 27 Diapic VARIANTS* 
| Male 
| Elevation 1 Elevation 2 Elevation 3 
(Female higher) (Equal) (Male higher) 
Back- Up- For- Back- Up- For- Back- Up- For- 
Female ward right ward ward right ward ward right ward 
Backward 103 80 70 78 99 84 37 71 107 
Upright 65 56 44 70 103 68 71 86 129 
Forward 56 34 39 64 96 99 83 100 140 


* x? = 243.3, Each subject contributed two responses to each silhouette: first, when 
the background was red; second, when it was blue. In computing chi square we have 
combined all responses of all subjects. This constitutes a violation of the requirements 
of independence for chi square; however, the chi-square values as computed are 
except one instance far in excess of the .01 level of confidence. Moreover, the case 
for the significance of our findings lies not upon the value of chi square itself but 
upon the coherence of the emerging pattern for each of the variables examined. 


initiator decreasingly as either figure moves his head in a forward direction; 


though it is suggested that the rotation of the female figure exercises a greater 
determinant than that of the male. 

In the second elevation position (both figures equal) the pattern is less 
clear, and it probably must suffice for us to observe that the male is seen as 
the initiator when his head is upright. The pattern in Elevation 3 (male 
higher) is virtually the opposite of that obtaining in Elevation 1. Again, it 
appears that the rotation of both figures is of significant importance and that, 
as the rotation is forward, the male figure is less often seen as the initiator, 


But the position of the male figure now appears to be a more important deter- 


minant than that of the female. 

Let us now consider which silhouettes tend to evoke interpretations in- 
volving the act of “telling.” The incidence with which such ascriptions are 
made is found in Table 2. It is evident that the difference between the three 


TABLE 2 


THE INCIDENCES OF RESPONSES INVOLVING “TELLING” FOR 


EACH OF THE 27 DiApic VARIANTS* 


Male 
Elevation 1 Elevation 2 Elevation 3 
i (Female higher) (Equal) (Male higher) 

Back- Up- For- Back- Up- For- Back- Up- For- 
Female ward right ward ward right ward ward right ward 

Backward 34 43 49 37 59 50 47 59 42 

Upright 51 90 67 55 95 59 44 72 44 

Forward 40 68 45 36 SF 66 39 51 52 


ж g2 110.87. 


See footnote to Table 1. 


4 JOURNAL OF SOCIAL PSYCHOLOGY 


evation positions is relatively slight, but that in all three a conspicuously 
eightened incidence is observed when the two figures are upright, looking 
irectly forward. In this instance the null hypothesis leads one to expect 54 
1 each cell. The obtained chi square of 110 exceeds the .01 level of confidence. 


TABLE 3 
Tue INCIDENCES oF Responses INVOLVING “ASKING” FOR 
EACH OF THE 27 DIADIC VARIANTS* 


Male 
Elevation 1 Elevation 2 Elevation 3 
т (Female higher) (Equal) (Male higher) 

Back- Up- For- Back- Up- For- Back- Up- For- 
Female ward right ward ward right ward ward right ward 
lackward 42 42 38 19 41 36 66 47 38 
Jpright 36 28 38 51 25 39 46 44 — 42 
'orward 49 41 37 46 39 35 34 34 28 


* 72 = 54.59. See footnote to Table 1. 


of “ ; but when both heads are forward this 
nterpretation is infrequent. On balance 


T. 
Tue INCIDENCES or RESPONSES INVOLVING “DEMANDING” 
FOR EACH OF THE 27 Diapic Vani 


ANTS* 
Elevation 1 El i 
vation evation 2 Elevation 3 
(Female higher) (Equal) (Male higher) 

Back- Up- For- Back- Up- For- Back- Up-  For- 
Female ward right ward ward right ward ward right ward 
ackward 47 43 42 74 27 42 21 
Beteke 37 2 ч та чү 35 12 42 
rward 39 20 46 42 33 22 47 34 39 
* y? — 157.36. See footnote to Table 1. 
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opposition to that for the act of “telling.” We might expect this because 
"telling" is the most impersonal of exchanges, whereas “demanding” is the 
most personal and imperative. Here we find the hollow-center effect startlingly 
visible. Our null expectation for each cell is 33. The chi square of 157 attains 
a highly significant value. It will be observed that for all three elevation pat- 
terns the center cell shows a very low incidence (nine, 12, and 12, respec- 
tively). It is clear that this highly charged type of interpersonal transaction 
is rarely seen when the figures are both in upright rotation. On the contrary, 
this interpretation is most notably applied when the figures are gn the same 
level (each with his head thrown back), a position that is commonly inter- 
preted as mutual defiance. 

In Table 5 we consider the fourth act, "offering." It is clear that the 


TABLE 5 
Tue INCIDENCES or RESPONSES INVOLVING "OFFERING" FOR 
EACH or THE 27 DiaApic VARIANTS* 


Male 
Elevation 1 Elevation 2 Elevation 3 
(Female higher) (Equal) (Male higher) 
Back- Up- For- Back- Up- For- Back- Up- For- 
Female ward right ward ward right ward ward right ward 
Backward 27 21 21 16 22 22 14 24 33 
Upright 26 23 21 24 16 20 27 22 22 
Forward 22 20 22 25 21 27 30 32 31 


* x2 = 21.33. See footnote to Table 1. 


pattern here is far less incisive than for the components reviewed earlier. We 
would expect on the null hypothesis 23 responses per cell. The chi square 
for this table is but 21 and without statistical reliability. There is lacking 
here any manifest or significant pattern, such as observed for other com- 
ponents of the “menu.” 

Finally, as may be seen in Table 6, we have calculated the frequency of 


TABLE 6 
Tue INCIDENCES OF RESPONSES INVOLVING PosrrivE RESPONSES 
FOR EACH OF THE 27 Ютлріс VARIANTS* 


Male 
Elevation 1 Elevation 2 Elevation 3 
(Female higher) (Equal) (Male higher) 
Back- Up-  For- Back- Up- For- Back- Up- For- 
Female ward right ward ward right ward ward right ward 
Backward 59 63 76 53 77 68 76 110 114 
Upright 96 129 104 83 122 116 75 108 113 
Forward 104 126 99 67 113 121 71 99 103 


* 42 = 142.29. See footnote to Table 1. 
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positive responses for each of the 27 silhouettes independently of the initiator 
or the act involved. On the null hypothesis we would expect 94 responses 
per cell. The obtained chi square of 142 greatly exceeds the .01 level of 
confidence. 

The pattern here turns out to be entirely lucid. In Elevation 1 in which 
the female figure is higher, the incidence of positive responses is reduced when 
the head of the female figure is rotated backwards. In Elevation 2 it is 
apparent that if either figure is rotated backwards, 


the incidence is conspicu- 
ously low. In Elevation 3, it is clearly 


manifest that the backward rotation of 
the male figure (now at a higher elevation) is associated with a low incidence 


of positive responses, Thus, it is clear that the most-important determinant 
in Elevations 1 and 3 is the rotati 


ation of the figures 
the subject’s inter- 


е ic human relationships are 
l attitudes of the figures involved. Moreover, 


me rough and approximate 


incidence of male initiation is clea 
active sense. The same may be said for the incidence of p 
On the other hand, responses involving “telling” ang * 


to be relatively independent of elevation but strongl 
interaction of the two head rotations, 


a complex inter- 
ositive responses. 
‘commanding” appear 
y determined by the 


ai 
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MODES OF CONFORMITY AND CAREER SELECTION 
OF RURAL AND URBAN HIGH-SCHOOL SENIORS* 


State University of New York at Buffalo 


EucENE L. Garer AND Мпллам F. WHITE! 


A. INTRODUCTION 


The senior year of high school represents for many the transition from 
passive dependency to a state of conflict where the student is forced to con- 
sider decisions concerning career choices as well as other patterns of future 
activity. This period may merely represent the formalization of four years of 
preparation or it may become the precipitating moment for choice as the 
secondary school years close their doors. But career decisions are usually not 
made randomly. The socialization process to which the individual is exposed 
early starts to exert pressures concerning career and occupational choices he 
will eventually make and: 

«+.» The occupations among which an individual makes his final choices 
do not usually vary very much. It is not certain that they always vary 
sufficiently to form the hierarchy presupposed by the level of aspiration 
theory. For example, the hierarchy used by Svalastoga in his empirical 
study consists of occupations ranging from shoe-shine boy to ambassador. 
It is most unlikely that anybody ever has to make his decision subject 
to such an extensive range of variation. (2, pp. 240.) 


When Reisman (6, 7) hypothesized his trichotomy of character orientation 
—tradition directed, inner-directed and other-directed—he also sought its 
application in the economy wherein the individual was socialized. As pointed 
out by Littunen and Gaier (5): 


Reisman’s tradition directed personality belongs to the agrarian, pre- 
industrial economy, the inner-directed personality appears to emerge in 
the context of an expanding, industrializing economy, while the other- 
directed personality type represents the highly industrialized economy of 
abundance. . . . whereas, the inner-directed personality conforms 
to the personal ideal laid down in childhood by his parents and the older 
generation in general, the other-directed personality usually conforms to 
his peer group. By extension, an economy of abundance and his indus- 
trialization fosters larger numbers of other-directed personalities. 


* Received in the Editorial Office, Provincetown, Massachusetts, on July 6, 1964. 
Copyright, 1965, by The Journal Press. 2 
1 Now at the University of Georgia. 
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B. Tue PROBLEM 


The purpose of the present study was to examine the cultural determinants 
toward educational and occupational goals of two senior classes: 
urban setting; the other, in a rural context. In this study, we wi 


ere not as 
interested in the content of the confor. 


ming influence as we were in the mech- 
ence on decisions toward Careers and occu- 


automatic axis that the senior high-school student may move. For example, 


does the person who has been strongly conformed by inner-directed culture 
show a significant direction toward education? Has the student's character, 
substantially weighted with an other-directed environment, bcen socially set 


toward anomie or autonomy? Does other-directed culture conformity bear 
any relationship to the desire for higher education? 


C. SUBJECTS AND PROCEDURE 


of each of what they i 
ality liabilities about t 


a. Getting along with others, 
people.” 
b. Adjusting to the environment, y crisis.” 
€. Concern for people, “She is always willing to help others » 
d. Friendships. "My ability to make friends,” 
—À € 


1 See Gaier and Wambach (4) for a full treatm t : Р 
cation scheme developed. cnt of the rationale of the classifi- 


“The ability to get along О.К. with other 


“She can face an 


ny 
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Positive character traits emphasized traditional virtues and typically in- 
cluded: 


a. Striving. "His strong desire to be somebody." 
b. Character Strength. "His general good moral standards." 


Typical responses expressed in the four subcategories of the area of achieve- 
ment include the following: 
a. Intellectual. “Му ability in school subjects." 
b. Physical. “My ability in sports." 
c. Self-confidence. “I am confident that I can do the job I set out 


to do." 
d. Miscellaneous. “Because he is famous.” 


A chi-square test was applied to the totals obtained for the three major 
categories of group behavior, character traits, and achievement for the four 
groups: male urban, male rural, female urban, and female rural (shown in 
Table 1). When personality assets are considered by urban and rural males, 
they perceive themselves differently (x? = 35.21, df = 2, p < .01). They 


TABLE 1 
FREQUENCY AND PER CENT BREAKDOWN OF PERSONALITY AssETS BETWEEN URBAN 
AND RURAL HIGH-SCHOOL SENIORS* 


Self 
Male Female 
(М = 243) (N = 240) 
Urban Rural Urban Rural 
Personal asset t % ] 9 f 9 f 9 
I. Group behavior 

a. Getting along 
with others 24 18 7 6 31 24 18 16 

b. Adjust to the 
environment 5 4 2 2 5 4 5 4 
c. Concern for people 5 3 2 2 4 3 5 4 
d. Friendship 9 y — — 11 9 mm ite 
Total 43 32 11 10 51 40 28 24 

II. Character traits 
a. Striving 12 9 4 4 11 9 TENE 
b. Character strength 24 18 9 8 17 13 11 10 
Total 36 27 13 12 28 22 11 10 
III. Achievement 

a. Intellectual 13 10 11 10 13 10 — — 
b. Physical 30 23 75 68 26 20 73 66 
c. Self-confidence 11 8 — = 10 8 — as 
d. Miscellaneous <= ga = = — = — — 
Total 54 41 86 78 49 38 73 66 
Grand total 133 100 110 100 128 100 112 100 


* Boys (self), x? = 35.21, $ < .01, with 2 df. 
Girls (self), 42 = 17.84, р < .01, with 2 df. 
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TABLE 1 (continued) 
FREQUENCY AND PER CENT BREAKDOWN Or PERSONALITY ASSETS BETWEEN URBAN 
AND RURAL HIGH-SCHOOL SENIORS* 


Father 
Male Female 
(N = 222) (N = 219) 
Urban Rural Urban Rural 
Personal asset Í % f % f % f % 
I. Group behavior 
a. Getting along 
with others 22 19 10 9 26 24 16 15 
b. Adjust to the 
environment 9 8 A же 6 6 — x 
€. Concern for people 38 33 28 24 36 34 25 24 
d. Friendship ocho — S EIS = = 
Total 71 62 38 33 68 64 41 39 
II. Character traits 
a. Striving 6 5 5 4 7 7 10 10 
b. Character strength 20 17 33 29 13 12 19 18 
Total 26 2 Св 3 0 19 729 28 
ПІ. Achievement 
a. Intellectual 7 6 6 3 6 6 MES 
b. Physical 8 7 33 29 11 10 3s 33 
с. Self-confidence 2 2 = — 1 1 — =— 
d. Miscellaneous 1 1 =- = — — — — 
Total I "ag "uy z 18 17 735 733 


athers differently (x? = 21.38, 


apr? p < .01). In 


of the assets of their mothers, however, no significant 
difference emerged (y? — 4.14, df — 


Perception of themselves (x? = 17.84, 

= 13.79, df = 2, 01), and their 

mothers (y? — 13.03, df = Lig :01). Е 
Rural boys, in comparison with urban boys, 

importance to character traits, as assets of their 

22 per cent respectively). Rural bo 

with physical attributes of 


assets, Urban b 
mothers as more Sensitive to group behavior, while ry 
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TABLE 1 (continued) 
FREQUENCY AND PER CENT BREAKDOWN OF PERSONALITY ASSETS BETWEEN URBAN 
AND RURAL HIGH-SCHOOL SENIORS* 


Mother 
Male Female 
(N — 225) (N — 220) 
Urban Rural Urban Rural 
Personal asset f 9 f h f 9 f 96 
I. Group behavior 

a. Getting along 
with others 26 22 6 6 29 25 9 9 

b. Adjust to the 
environment 9 8 4 4 5 4 — — 
c. Concern for people 36 31 42 39 54 47 47 45 
4. Friendship „з ед SS 2S ee 
Total 72 62 52 49 88 76 56 54 

II. Character traits 
а. Striving 6 5 6 6 1 1 — — 
b. Character strength 20 17 28 26 2 2 17 16 
"Total 26 22 34 32 3 3 17 16 
III. Achievement 

a. Intellectual 5 4 3 3 — — = = 
b. Physical 1$ 1 19 16 23 20 32 30 
c. Self-confidence 1 1 — — 1 1 - = 
d. Miscellaneous A ж, l m жш ‚шыш тз 
Total 19 ^16 22 19 24 21 32 ^30 
Grand total 117 100 108 100 115 100 105 100 


* Boys (mothers), х2 = 4.14, p < .20, with 2 df. 
Girls (mothers), x? = 13.03, р < .01, with 1 df. 


mothers' maintaining greater character strength. Clearly distinguishing from 
rural youth, urban boys and girls reported their mothers much more con- 
cerned with "getting along with others" (22 per cent and 25 per cent and 
six per cent and nine per cent respectively) and yet much less directed toward 
character traits (17 per cent and two per cent and 26 per cent and 16 per 
cent respectively). 

One of the most striking findings contained in Table 1 is that rural girls 
placed absolutely no value on either their own intellectual assets or those of 
their fathers or their mothers. This finding increases in psychological sig- 
nificance when Table 2 (liabilities) is examined. Rural girls placed no onus 
on any intellectual inadequacy of their fathers and mothers and only a total 
value of 14 per cent as a fear of their own inadequacy. When we inspect the 
table referring to the choices for post high-school direction [see "Table 4], 
surprisingly we note that 15 per cent of rural girls plan on a four-year college 
program and 25 per cent intend a two-year business school. Since intelligence 
is received neither favorably or unfavorably, one wonders if the rural girls 
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plan posthigh-school education for marriage reasons, for Soaks — 
status, or merely for a means of avoiding rural boys (since on y two per c 

of the rural boys tested noted any plans to attend college). . А - 

'Table 1 indicates that a much higher degree of identification existe 

between urban youth and their Peer groups than for either of the two groups 
in the rural population. The Pattern is similarly exten 
of the assets of both the fathers and mothers, 
importance in the socialization process of the 
in the direction of character traits and achievement. And the same per- 
centage favors what this group of tural students “liked best” in their fathers 
and mothers. These data are overwhelmingly in accord with Reisman’s 
propositions: namely, that rural youth is 


milieu of the inner-directed culture in whic 


TABLE 2 


OWN OF PERSONALITY LIABILITIES ОЕ URBAN 
AND RURAL HIGH-SCHOOL SENIORS* 


Self 
Male Female 
(N = 245) (М = 217) 
Urban Rural rban Rural 
Personal liability Í 9 Í 9 96 f % 
I. Group behavior 
а. Fear of social 

situations 2 2 EM as 5 4 4 4 

b. Inability to make 
friends 4 жй шы 2 2 & 4 
Total б S “бср OP 3 $8 

П. Character traits 

a. Selfishness 19 15 20 17 17 15 14 14 
b. Bad temper 8 6 11 9 14 12 14 14 
c. Impatience 2 2 1 1 5 4 d] 9] 
d. Conceit—arrogance 10 8 14 12 9 8 4 4 
€. Frivolity 1 1 — — 2 2 = = 

f. General Character 
weakness 8 6 8 7 — — 10 10 
Total “4g ET “42 
III. Personal а Ре HG е a > ж 
a. Intellect inadequacy 35 28 18 15 13 11 14 14 
b. Physical inadequacy 17 14 35 29 27 96 20 18 
с. Independence 2 2 1 1 — — — — 
d. Lack of confidence 10 8 6 4 6 5 14 14 
е. Anxiety = a = = 1 1 4 4 
f. Moodiness 5 3 1 1 1 1 — — 
g. Financial difficulties 2 2 5 4 13 11 = — 
Total ^7 7g 66 54 61 54 52 50 

Grand total 125 


115 100 102 100 


* Boys (self), x? = 0.09, N.S., 1 df. 
Girls (self), x2 = 0.31, N.S., 2 df. 
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fashioned virtues, and parent-centered ideals and propositions (6). Students 
within this context would internalize the values and attitudes of their parents 
and perceive success within the boundaries of the posited conformity. 

Urban adolescents, however, would be more sensitive to group control. 
They would more easily change their opinions or those of their parents in 
order to be found more acceptable b: the “significant others” in their peer 
groups and social cliques. 

The perception of success in future goals should reflect what the group 
considers important rather than a mere identification with parental “dictum.” 
One of the most striking contrasts is the high percentage of perceived value 
(68 and 66 per cent for rural males and females respectively) placed in physical 
achievement. Approval for the perception of parental physical assets (29 
per cent rural boys, 33 per cent rural girls for fathers and 16 per cent for 


TABLE 2 (continued) 
FREQUENCY AND PER CENT BREAKDOWN OF PERSONALITY LIABILITIES OF URBAN 
AND RURAL HIGH-SCHOOL SENIORS* 


Fathers 
Male Female 
(М = 225) (М = 216) 
Urban Rural Urban Rural 
Personal liability f 9 f 9 f % f % 
1. Group behavior 
a. Fear of social 

situations 5 4 2 2 8 7 4 4 

b. Inability to make 
friends 1 1 — — — = 4 4 
Total 6 5 2 2 8 7 8 8 

II. Character traits 

a. Selfishness 17 15 13 12 25 23 18 17 
b. Bad temper 19 17 23 21 17 16 16 15 
c. Impatience 1 1 1 1 1 1 ш Е 
а. Conceit—arrogance 24 21 28 25 36 33 33 31 
е. Frivolity — — = же 1 1 lx DEL 

f. General character 
weakness 1+ 12 3 3 2 2 — m 
Total 75 66 68 62 82 76 67 63 

III. Personal 

a. Intellect inadequacy 3 3 3 3 и 9 — — 
b. Physical inadequacy 5 4 30 27 7 6 15 14 
c. Independence 10 9 5 4 1 1 3 3 
d. Lack of confidence 2 2 — — 1 1 4 4 
e. Anxiety 9 6 = = = = 9 8 
f. Moodiness 3 3 2 2 — — ==. = 
в. Financial difficulties 2 2 - = - = =- = 
Total 34 29 40 36 20 17 31 29 
Grand total 115 100 110 100 110 100 106 100 


* Boys (fathers), x2 = 1.17, N.S., 1 df. 
Girls (fathers), x? = 3.83, p < 20, 2 df. 
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TABLE 2 (continued) 
'UENCY AND PER CENT BREAKDOWN OF PERSONALITY LIABILITIES ОЕ URBAN 
юнә AND RURAL Нїсн-$сноог, SENIORs* 


Mothers 
Male Female 
(N = 228) (N = 210) 
Urban Rural Urban Rural 
Personal liability f % f % f % f 96 
I. Group Bea tor T 
- Fear of socia 

Ы bees 12 10 11 10 3 3 14 13 

b. Inability to make 
friends =] жа = aL == 
Total 12 10 11 10 4 4 14 13 

' II. Character traits 

a. Selfishness 14 12 24 21 30 29 x cd 
b. Bad temper 15 13 10 9 15 14 7 7 
с. Impatience 1 1 = _ 1 1 5 5 
4. Conceit—arrogance 12 10 8 7 34 33 26 25 
е. Frivolity — — 3 3 1 1 — — 

f. General Character 
Weakness т 8) 4 22 NES 
"Total 54 46 48 43 83 80 50 49 

II. Personal 
3. Intellect inadequacy 14 12 9 8 9 8 = = 
b. Physical inadequacy 12 11 34 29 7 6 27 26 
€. Independence 6 5 — = 1 1 — — 
d. Lack of confidence R 7 MES 1 1 — | 
е. Anxiety 4 4 эю. с жы ee 14 12 
£ oodiness 4 4 3 3 LL — — 
E. Financial difficulties 1 1 8 7 = — = <= 
Total 49 "uc шр a Is qp n 38 
Grand total 115 100 113 100 105 100 105 100 
* Boys (mothers), у2 — 0.64, N.S., 2 df, 
Girls (mothers), х2 = Н - f 


rural boys and 3 
attributes, the ru 


or their mothers, 


Of course, what the high influence of physica] Prowess and 
may actually reflect js the great emphasis given by the 
culture on physical beauty, Perhaps the 
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college degrees, or long-range career plans. This has been well documented 
in studies spanning a generation of adolescent desires and aspirations (3). 
Although the urban seniors also listed physical attributes as worthwhile indi- 
cators of their own assets (23 and 20 per cent for males and females respec- 
tively), they did indicate that their own assets were also distributed among 
factors of intelligence, self-confidence, and peer-group acceptance. 


2. Liabilities 


The 1,341 personality liabilities listed by the subjects were also classified 
into three major groups: group behavior, character traits, and personal, with 
each group further divided into subgroups adapted from Reisman’s question- 
naire (7) as modified by Gaier and Wambach (4). Group behavior, specifi- 
cally mentioned as detrimental to success, contained some of the following 
representative responses: 


a. Fear of social situations. "She's afraid to go places where there 
are crowds of people.” 

b. Inability to make friends. “Some people are hard for me to be 
friendly with.” 


The subcategories of character traits included the following: 


Selfishness. “I always want my own way.” 

Bad Temper. “He gets awful mad.” 

Impatience. “He gets upset with people.” 

Conceit and arrogance. “He thinks he is always right.” 
Frivolity. “I’m not serious enough; everything is a big joke.” 
General Character Weakness. “He’s not much good.” 


Ss Ro sa 


The personal category which provides an index of the inadequacy the indi- 
vidual perceives for himself or his parents in the challenge for a successful 
life contained the following subdivisions: 


Intellectual inadequacy. “My school marks are pretty bad.” 
Physical inadequacy. “My hips.” “My appearance.” 
Independence. "I don't like to be told what to do.” 

Lack of confidence. “He always thinks he is inferior.” 
Anxiety. “She’s always worried about something.” 
Moodiness. “Her mood changes just like that.” 

Financial difficulties. “He can’t handle money.” 


SP hoe SS 


A chi-square test was applied to the number of the dispersion among 
liabilities [see Table 2]. Only one significant difference in any area emerged: 
girls in the perception of their mothers significant at the .01 level. The girls’ 
perception of the liabilities of their mothers is heightened by the lack of sig- 
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i in the other two categories. In 1950, Stokes (9) hypothesized that 
one 9 cultural context have much more behavioral identification with 
their poc ule than boys with their fathers. What might well be na 
in our data on liabilities is an index of behavioral identification rather t an 
emotional identification. Urban females attributed 80 per cent of their 
mothers' liabilities to character traits; only eight per cent of the liabilities was 
founded on intellectual inadequacy. Were these urban girls expressing a 
behavioral identification in society but not an emotional behavior because they 
did not wish to identify with the “bad character" of their mothers? . 

It is especially noteworthy that neither urban nor rural students perceived 
lack of specific abilities and fears of group behavior as meaningful liabilities 
within their social cultures, whereas group behavior characteristics were 
significant assets in "belonging" to their respective adolescent societies. Among 
the personality liabilities, “selfishness” and “conceit-arrogance” 


were of pri- 
both schools but 


to a marked degree for 


oncerned about their ow; 
liabilities as were urban youth; rather, the 


to the physical “defects” 
The highly Prestigious va 
for recognition in adolesce. 


n physical limitations and 
Y were considerably more sensitive 
of both parents than were urban boys and girls. 
lue of physical assets, which rural youth demand 


nt society, diminished within the scale of comparable 
liabilities. Even though the boys and girls of urban culture appeared more 


concerned than rural youth about their own lack of achievement, nevertheless, 


tural youth perceived a deficiency of Personal achievement among their 
Parents as more unfavorable than their urban counterparts, 


ө vocational aspirations 
of the present subjects, 
When the plans of high-school s 


eniors are compared with 
of their parents according to urban 


the occupations 
and rural orientation, exp, 


ected significant 
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TABLE 4 
Per CENT BREAKDOWN OF POSTHIGH SCHOOL AND OCCUPATION CHOICES 
Urban Rural 
Boys Girls Boys Girls 
Choice (N=53) (N=30) (М = 44) (N = 40) 
Four-year college ` 54 43 2 15 
Technical or business school 23 20 18 25 
Specific skill, labor, trade — 9 10 10 
Chemical, general labor, sales, 
army, farming 19 22 50 15 
Don't know. Anything 
reasonable. 4 6 20 35 
Total 100 100 100 100 


differences in the two cultures emerge. Urban adolescents are considerably 
more concerned about formal schooling upon completion of high school than 
rural seniors. Whereas 54 per cent of the urban males and 43 per cent of 
urban females planned to seek admission to a four-year college, only two per 
cent of the males and 15 per cent of the females in the tural schools were con- 
sidering college. Technical or business school attendance was planned by 
23 per cent of the males in the city school and 18 per cent of the males in 
the rural community. A greater percentage of rural girls desired business- 
school courses than urban girls. Half of the rural sample of males would seek 
farming, the military service, personal jobs, or unskilled employment; but 
only 19 per cent of the urban boys stated these as work choices. The plans of 
the urban youth were more specific. Only four per cent of the urban males 
and six per cent of the urban girls were undecided, while 20 per cent of the 
rural boys had not made up their minds and 35 per cent of the girls would 
do “something that is reasonable.” 
From Table 3, we can observe that the occupation of the fathers of the 
ural group was heavily directed to farming and unskilled labor (60 per cent 
and 52 per cent respectively). Two Per cent of the city labor group had fathers 
who were unskilled laborers. As expected, urban fathers were more heavily 
engaged in professional and semiprofessional, managerial, and sales positions. 


E. Concrusion 


In line with Reisman’s assessment of the orientation of subjects according 
to an inner-directed and other-directed Personality hypothesis, the present rural 
subjects appear to follow more frequently the old-fashioned virtues the prin- 
ciples, and philosophy of their parents, By contrast, urban youth apparently 
were more directed by group trends. This is not © suggest that these rural 
youth were inner-directed isolates. While the individual may appear to 
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influenced by varying aspects of tradition, inner-directed, and other-directed 
culture, Reisman's theory holds that one dominant socializing agent pervades 
a given personality. Thus, while there is much overlapping among the cate- 
gories, cach individual lives in tune with the operative function of his culture. 

In the present study, the rural sample has emerged as more inner directed, 
with the subsequent choosing occupations and careers in accord with the occu- 


pations and careers of their parents. In a sense, they have rejected an interest 


in continuing formal education; in the same manner, their fathers and mothers 


have not fulfilled themselves in this way. Urban youth, on the contrary, per- 
ceived themselves as sensitive to group thinking and behavior. Since they are 
h their peers deem continuing education as 


an important means of mobility, these youth are more concerned with college 


education. And it does seem apparent that both peer groups and parents are 
-school seniors to advance up the educational ladder 
11 as mobility. 


living in an environment in whic 


influencing urban high 
as a basis for growth toward personal self-actualization as we 
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